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^ (54) Title: NUCLEIC ACIDS WHICH CODE FOR THE ENZYME ACTIVITIES OF THE SPINOSYN BIOSYNTHESIS 

g (54) Bezeichnung: NUCLEINSAUREN, DIE FUR ENZYMAKTTVITATEN DER SPINOSYN-BIOSYNTHESE CODIEREN 

^ (57) Abstract: The present invention relates to nucleic acids which code for the enzyme activities of the spinosyn biosynthesis. 
^ The invention also relates to the corresponding enzymes as such. The invention further relates to a method for producing spinosyn 
^ derivatives and spinosyn precursors. 

o 

Q (57) Zusammenfassung: Die vorliegende Erfindung betrifft Nucleinsauren, die fur Enzymaktivitaten der Spinosyn-Biosynthese co- 
dieren, sowie die entsprechenden Enzyme per se. Weiterhin betrifft die Erfindung Verfahren zum Herstellen von Spinosyn-Derivaten 
und -Vorstufen. 
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Nucleinsauren. die fur Enzymaktivitaten der Spinosvn-Biosvnthese codieren 

Die vorliegende Erfindung betriffi Nucleinsauren, die fur Enzymaktivitaten der 
Spinosyn-Biosynthese codieren, sowie die entsprechenden Enzyme per se. 

5 

Spinosyne stellen eine neue Gruppe von makrolidischen Verbindungen dar, die aus 
dem Actinomyceten Saccharopolyspora spinosa isoliert worden sind (Mertz und 
Yao, 1990). Sie werden zur Bekampfiing von Insekten eingesetzt (WO 97/00265, 
WO 94/20518, WO 93/09126, US 5670364, US 5362634, US 5227295, US 
10 5202242). Spinosyne zeigen eine starke insektizide, jedoch keine antibakterielle 
Aktivitat, wodurch sie von den konventionellen Makroliden, wie Tylosin, 
Spiramycin und Erythromycin, die keine insektizide, jedoch antimikrobielle Wirk- 
samkeit aufweisen, unterscheidbar sind. 

15 Die Struktur der Spinosyne setzt sich zusammen aus einem tetracyclischen Poly- 
ketidgrundgertist (Aglycon) mit einem 12-gliedrigen Makrolidring und einem 5,6,5- 
cis-anti-trans-Tricyclus, sowie einem D-Forosamin- und einem 2,3,4-Tri-O-Methyl- 
L-Rhamnose-Zuckeranteil (Kirst et al., 1991). Mehr als 20 verschiedene natiirliche 
Spinosyn-Derivate, der sogenannte A83543 Komplex, ist bisher beschrieben worden 

20 (WO 97/00265, WO 94/20518, WO 93/09126). Diese Derivate variieren in der Sub- 
stitution von einer oder einigen Methylgruppen am tetracyclischen Grundgeriist, am 
Forosamin- oder am Tri-Methyl-Rhamnose-Zuckeranteil. Ein 17-Pseudoaglycon, 
dem der Forosamin-Zuckeranteil fehlt, ist ebenfalls aus Kulturbriihen von S. spinosa 
isoliert worden. 

25 

Die Hauptkomponenten des von S. spinosa gebildeten A83543 Komplexes stellen die 
Varianten Spinosyn A und Spinosyn D dar, die die wesentlichen Bestandteile des 
Produktes Spinosad darstellen (vgl. Pesticide Manual, British Crop Protection 
Council, 11 th Ed., 1997, Seite 1272 und Dow Elanco trade maganzine Down to Earth, 
30 Vol. 52, NO:. 1, 1997 und die darin zitierte Literatur). 
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Aufbauend auf Untersuchungen zum Einbau von C 13 -markiertem Acetat, Propionat, 
Butyrat oder Isobutyrat konnte gezeigt werden, dass die Biosynthese von A83543 
einem Polyketid-Biosyntheseweg folgt (Nakatsukasa et al, 1990). Polyketide werden 
durch multifunktionelle Enzyme, den sog. Polyketidsynthasen (PKS's) aus kurz- 
5 kettigen Saurebausteinen wie Acetat, Propionat oder Butyrat aufgebaut. Ahnlich wie 
die verwandten Fettsauresynthasen (FAS's) katalysieren sie decarboxylierende Poly- 
kondensationsschritte der als CoA-Thioester aktivierten Bausteine. Wahrend FAS's 
nach jedem Kondensationsschritt eine vollstSndige Reduktion der intermediar an der 
wachsenden Polyketidkette entstehenden p-Oxoester durch Ketoreduktion, Dehy- 
10 dratation und Enoylreduktion katalysieren, konnen PKS's bestimmte Reduktions- 
schritte auslassen. Modulare Typ I PKS's bestehen aus einem oder mebreren groBen 
multifunktionalen Proteinen. Iterative Typ II PKS's stellen dagegen einen Komplex 
aus weitgehend monofunktionalen Proteinen dar. 

15 Die enzymatischen Aktivitaten von modularen Typ I PKS's lassen sich zu soge- 
nannten Modulen zusammenfassen. Hierbei tragt ein Modul eine Anordnung von drei 
enzymkatalytisch aktiven Domanen, die zu einer Verlangerung der wachsenden Poly- 
ketidkette urn eine biosynthetische Verlangerungseinheit fuhren. Bei diesen 
Domanen handelt es sich urn eine P-Ketoacyl:Acyl Carrier Protein Synthase- 

20 Domane, eine Acyltransferase-Domane und eine p-Ketoacyl:Acyl Carrier Protein- 
DomSne. Ein Modul kann auch eine Ketoreduktase-, eine Dehydratase-, eine Enoyl- 
reduktase- und eine Thioesterase-Domane tragen. Ein sog. Ladungsmodul, das am 
Beginn der Biosynthese steht kann von den genannten Domanen lediglich eine Acyl- 
transferase-Domane und eine p-Ketoacyl:Acyl-Cairier Protein-Domane tragen, sowie 

25 eine enzymatisch inaktive p-Ketoacyl:Acyl Carrier Protein Synthase-Domane. Eine 
Polyketidsynthase-Domane umfasst jeweils eine dieser genannten enzymatischen 
Aktivitaten. 

Aufgrund der potenten insektiziden Wirkung sowie der bemerkenswerten Struktur 
30 der Spinosyne besteht ein groBes Interesse, die genetischen Informationen ftlr deren 
Biosynthese zu entschliisseln. 
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Gegenstand der Erfindung sind Nucleinsauren, welche zumindest eine Region um- 
fassen, die ftir eine Enzymaktivitat codiert, welche an der Biosynthese von 
Spinosynen beteiligt ist 

5 

Die vorliegende Erfindung stellt ein Cluster von offenen Leserahmen (ORF's) bereit, 
deren Translationsprodukte an der Biosynthese von Spinosynen beteiligt sind. 
Weiterhin werden zusatzliche Gene bzw. ORF's bereitgestellt, die auBerhalb des ca. 
120 kb groBen Spinosyn-Biosyntheseclusters liegen, und deren Translationsprodukte 
10 an der Rhamnose-Zuckerbiosynthese beteiligt sind. 

Bei den erfindungsgemaBen Nucleinsauren handelt es sich insbesondere urn einzel- 
strangige oder doppelstrangige Desoxyribonucleinsauren (DNA) oder Ribonuclein- 
sauren (RNA). Bevorzugte Ausfiihrungsformen sind Fragmente genomischer DNA 
15 und cDNA's. 

Der Ausdruck "zumindest eine Region", wie er hierin verwendet wird, bedeutet, dass 
die erfindungsgemaBe Nucleinsaure eine oder mehrere Sequenzen umfassen kann, 
welche jeweils fiir einzelne Aktivitaten codieren, die Schritte bei der Synthese von 
20 Spinosynen durchfuhren. Es werden demnach auch Nucleinsauren als erfindungsge- 
maB betrachtet, die nur fur eine einzige Enzymaktivitat der Spinosyn-Biosynthese 
codieren. 

Der Ausdruck "Enzymaktivitat", wie er hierin verwendet wird, bedeutet, dass 
25 ausgehend von den hierin betrachteten Nucleinsauren zumindest derjeriige Teil eines 
vollstandigen Enzyms exprimiert werden kann, der noch die Katalyseeigenschaften 
des Enzyms ausubt. 

Insbesondere codieren die erfindungsgemaBen Nucleinsauren fiir Enzymaktivitaten 
30 von Polyketidsynthasen, Methyltransferasen, Epimerasen, Glycosyltransferasen, 
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Aminotransferasen, Dimethyltransferasen, Reduktasen, Dehydratasen und/oder 
Cyclisierungsenzymen. 

Bevorzugt handelt es sich bei den erfindungsgemaBen Nucleinsauren urn DNA- 
5 Fragmente, die genomischer DNA von S. spinosa entsprechen. 

Besonders bevorzugt umfassen die erfindungsgemaBen Nucleinsauren zumindest 
eine Sequenz ausgewahlt aus 

10 (a) den Sequenzen gemaB SEQ ID NOS: 1, 2, 3, 4, 5, 7, 9, 11, 13, 15, 17, 

19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 52 
oder 54, 

(b) zumindest 14 Basenpaare langen Teilsequenzen der unter (a) defi- 
1 5 nierten Sequenzen, 

(c) Sequenzen, welche an die unter (a) definierten Sequenzen hybri- 
disieren, 

20 (d) Sequenzen, welche eine zumindest 70 %ige, bevorzugt eine 80 %ige, 

besonders bevorzugt eine 90 %ige Identitat zu den unter (a) definier- 
ten Sequenzen aufweisen, 

(e) Sequenzen, welche zu den unter (a) definierten Sequenzen komple- 
25 mentar sind, und 



(0 



Sequenzen, welche aufgrund der Degeneriertheit des genetischen 
Codes fur dieselbe Aminosauresequenz kodieren wie die unter (a) bis 
(d) definierten Sequenzen. 
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Der Ausdruck "hybridisieren", wie er hierin venvendet wird, beschreibt den Vor- 
gang, bei welchem ein einzelstrangiges Nucleinsauremolekiil mit einem komple- 
mentaren Strang eine Basenpaarung eingeht. Auf diese Weise k6nnen beispielsweise 
ausgehend von genomischer DNA aus Organismen, die phylogenetisch mit S. 
5 spinosa verwandt sind und die Fahigkeit der Biosynthese von Spinosynen besitzen, 
DNA-Fragmente isoliert werden, welche dieselben Eigenschaften wie die aus S. 
spinosa isolierten Fragmente aufweisen. 

Bevorzugte Hybridisierungsbedingungen sind nachstehend angegeben: Hybridisie- 
10 rungsldsung: 5 x SSC; Blocking Reagens (Roche Diagnostics GmbH, Mannheim, 
Deutschland), 1 %; N-Lauroylsarcosin, 0,1 %; SDS (Sodiumdodecylsulfate) 0,02 %; 
Hybridisierungstemperatur: 60°C; erster Waschschritt: 2 x SSC bei 60°C; zweiter 
Waschschritt: 2 x SSC bei 60°C; bevorzugt zweiter Waschschritt: 0,5 x SSC bei 
60°C; besonders bevorzugt zweiter Waschschritt: 0,2 x SSC bei 60°C. 

15 

Der Grad der Identity der Nucleins&uren wird vorzugsweise bestimmt mit Hilfe des 
Programms GAP aus dem Programmpaket GCG (Devereux et al., 1984), Version 9.1 
unter Standardeinstellungen. 

20 Besonders hervorgehoben werden NucleinsSuren, die 

(1) entweder alle Sequenzen, die fur Schritte der Forosamin- und Trimethyl- 
Rhamnose-Biosynthese codieren, umfassen, insbesondere die Sequenzen ge- 
mSB SEQ ID NOS: 4 und 51, oder 
25 (2) alle Sequenzen, die fur Schritte der Polyketidsynthese codieren, umfassen, 
. insbesondere die Sequenzen gemaB SEQ ID NOS: 5 und 6, oder 
(3) alle Sequenzen, die flir alle Schritte der Forosamin-, Trimethyl-Rhamnose- 
und Polyketidsynthese codieren, umfassen, insbesondere die Sequenzen ge- 
maB SEQ ID NOS: 1, 2, 3 und 51. 
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Alle zur Spinosyn-Biosynthese oder zur Synthese von Vorstufen, wie sie nach- 

r 

stehend definiert sind, benotigten DNA-Sequenzen konnen sich somit auf einem 
einzelnen Vektor befinden. Diese Nucleinsauren konnen aber auch auf zwei oder 
mehreren Vektoren vorliegen und gleichzeitig oder nacheinander in einer Wirtszelle 
5 exprimiert werden. 

Alle ORF's der erfindungsgemaBen Nucleinsauren konnen von ihren eigenen Pro- 
motoren oder von heterologen Promotoren angeschaltet werden. 

10 Gegenstand der vorliegenden Erfindung sind auch die regulatorischen Regionen, 
welche natOrlicherweise, d.h. im Ursprungsorganismus S. spinosa, die Transkription 
der erfindungsgemaBen Nucleinsauren kontrollieren. 

Der Ausdruck "regulatorische Regionen", wie er hierin verwendet wird, bezieht sich 
15 auf Promotoren, Repressor- oder Aktivator-Bindungsstellen, Repressor- oder Akti- 
vatorsequenzen, und Teiminatoren. Femer sind genetisch mobile Elemente, welche 
natiirlicherweise, d.h. im Ursprungsorganismus S. spinosa vorkommen, ebenfalls von 
diesem Ausdruck imifasst. Solche genetisch mobilen Elemente konnen transposable 
oder mobilisierbare Elemente oder funktionelle Teile davon, IS-Elemente oder an- 
20 dere Insertionselemente sein. Weiterhin sind auch amplifizierbare DNA-Elemente 
(Amplifiable Units of DNA, AUD; Fishman and Hershberger, 1983), welche natiir- 
licherweise, d.h. im Ursprungsorganismus S. spinosa vorkommen, von diesem Aus- 
druck umfasst. Die Erfindung betriffl auch jede {Combination dieser regulatorischen 
Regionen untereinander oder mit heterologen DNA-Fragmenten, wie z.B. Promo- 
25 toren, Repressor oder Aktivator-Bindungsstellen, transposablen, mobilisierbaren oder 
transduzierbaren Elementen. 

Gegenstand der vorliegenden Erfindung sind weiterhin DNA-Konstrukte, die zu- 
mindest eine erfindungsgemaBe Nucleinsaure und einen heterologen Promotor um- 
30 fassen. 
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Der Ausdruck "heterologer Promotor", wie er hierin verwendet wird, bezieht sich auf 
einen Promotor, der im Ursprungsorganismus nicht die Expression des betreffenden 
Gens (ORF's) kontrolliert. 

5 Die Auswahl von heterologen Promotoren ist davon abhangig, ob zur Expression 
pro- oder eukaryotische Zellen oder zellfreie Systeme verwendet werden. Ein bevor- 
zugtes Beispiele fiir einen heterologen Promotor ist der Promotor des /ne/-Gens aus 
dem Vektor pIJ702 (The John Innes Foundation, Norwich, UK 1985). Die heterologe 
Expression kann z.B. eingesetzt werden, urn zu einer Steigerung der Produktion von 
10 Spinosyn im Vergleich zum natiirlichen Spinosyn-Produzenten zu gelangen. 

Gegenstand der Erfindung sind ferner Vektoren, die zumindest eine der erfindungs- 
gemaBen Nucleinsauren enthalten. Als Vektoren konnen alle in molekularbio- 
logischen Laboratorien verwendeten Phagen, Plasmide, Phagmide, Phasmide, 
15 Cosraide, YACs, BACs, PACs, kunstliche Chromosomen oder Partikel, die fiir einen 
Partikelbeschuss geeignet sind, verwendet werden. 

Bevorzugt sind BAC- Vektoren. BAC- Vektoren (Bacterial Artificial Chromosome) 
sind entwickelt worden zur Klonierung von groBen DNA-Fragmenten (Shizuya et aL, 

20 1992). Es handelt sich urn "single-copy" Plasmide mit einem F-Faktor Origin, die 
DNA-Fragmente mit einer durchschnittlichen GroBe von 120 Kilobasenpaaren (kb) 
tragen konnen. Sie sind replizierbar in Escherichia coli. Der BAC-Vektor 
pBeloBACll (Kim et al., 1996) tragt einen T7 und einen SP6 Promotor, welche die 
Klonierungsstelle flankieren und als Startbereich fur Sequenzierungsprimer sowie 

25 zur Generierung von RNA-Transkripten verwendet werden konnen. 

Besonders bevorzugt sind die am 18. August 1999 bei der Deutschen Sammlung von 
Mikroorganismen und Zellkulturen GmbH (DSMZ), Mascheroder Weg lb, D-38124 
Braunschweig, in Ubereinstimmung mit den Bestimmungen des Budapester Ver- 
30 trages unter den Hinterlegungsnummern DSM 13010, DSM 1301 1 und DSM 13012 
hinterlegten BAC-Shuttleklone, die Gegenstand der vorliegenden Erfindung sind. 
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Die hinterlegten BAC-Shuttleklone PI 1/G6, P8/G1 1 und P11/B10 tragen jeweils ein 
mindestens 100 kb groBes DNA-Fragment aus S. spinosa. Die Klone P11/G6 und 
P11/B10 txagen jeweils einen Teil der Nucleinsauresequenz gemaB SEQ ID NO: 4, 
sowie die angrenzenden vollstandigen Nucleinsauresequenzen gemaB SEQ ID NOS: 
5 und 6, sowie einen an die Nucleotidsequenz gemaB SEQ ID NO: 6 3'angrenzende 
DNA-Bereich (Abb. 7). Der Klon P8/G11 tragt einen Teil der Nucleinsauresequenz 
gemaB SEQ ID NO: 6, die vollstandigen Nucleinsauresequenzen gemaB SEQ ID 
NOS: 5 und 4, sowie einen an die Sequenz gemaB SEQ ID NO: 4 3 '-angrenzenden 
DNA-Bereich (Abb. 7). 

In gleicher Weise sind auch PAC- und alle anderen funktionell gleichwertigen 
Vektoren, die es erlauben, groBe DNA-Fragmente, insbesondere solche DNA- 
Fragmente, die groBer als 30 kb, voizugsweise groBer als 40 kb, besonders bevorzugt 
groBer als 60 kb sind, in heterologe Wirtszellen zu ubertragen und dort eine 
Etablierung von Fremd-DNA zu gewahrleisten, fiir eine Spinosyn-Produktion ge- 
eignet. Vorzugsweise werden solche BAC-, PAC- und funktionell gleichwertigen 
Vektoren venvendet, die zu einem Shuttle- Vektor modifiziert sind und z.B. eine 
Plasmidreplikation sowohl in Gram-negativen Bakterien, wie Escherichia coli, als 
auch in Gram-positiven Bakterien, wie Streptomyces, erlauben. Solche bevorzugten 
Shuttle- Vektoren konnen DNA-Fragmente einer GrSfle tragen, die in ublichen 
Vektoren, wie z.B. Cosmidvektoren, nicht klonierbar sind und nicht in heterologe 
Wirte, wie Actinomyceten, z.B. Streptomyceten, Qbertragbar sind. Letztere Vektoren 
ktinnen sowohl durch Transformation, Konjugation, Elektroporation, Protoplasten- 
transformation oder andere geeignete Verfahren ubertragen • werden. In hervor- 
ragender Weise sind solche Shuttle- Vektoren innerhalb einer heterologen Population 
Gram-negativer oder Gram-positiver Bakterien, zwischen Gram-positiven und Gram- 
negativen Bakterien, zwischen Bakterien und Archea, zwischen Pro- und 
Eukaryonten konjugativ ubertragbar. Die in heterologe Wirte, wie z.B. 
Streptomyceten, Ubertragenen BAC-, PAC- oder funktionell gleichwertigen Shuttle- 
Vektoren konnen autonom repliziert werden oder ins Genom des Wirtes integriert 
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werden. Letztere Integration kann iiber homologe Rekombination, iiber einen OC31- 
Integrationsmechanismus (Hopwood et al, 1985), tiber ortsspezifische Integration, 
die von pSAM2 (Smokvina et al, 1990; WO 95/16046) determinierten Funktionen 
abhangt oder tiber Mini-Circle vennittelte Funktionen (Motamedi et al., 1995; 
5 WO 96/00282) erfolgen. 

Solche Shuttle- Vektoren erlauben es, spezifische, durch auBerordentlich groBe DNA- 
Bereiche determinierte Biosynthesewege von PrimSr- oder SekundSrmetaboliten, 
durch Transfer eines einzigen rekombinanten Vektors heterolog in besonders ge- 

10 eigneten Wirtszellen zii exprimieren. So kann das identifizierte Cluster fiir die Bio- 
synthese von Spinosyn in Organismen, wie Actinomyceten, z.B. Streptomyces, durch 
Transfer eines einzigen rekombinanten Shuttle-Vektors ausgeprSgt werden. Aufgrund 
der GroBe dieses Biosyntheseclusters ist diese heterologe Expression der Spinosyn- 
Biosynthese mit einem einzigen Cosmidvektor nicht moglich. Die Ubertragung eines 

15 rekombinanten BAC-, PAC- oder funktionell gleichwertigen Shuttle-Vektors, der die 
erfindungsgemaBen NucleinsSuren tragt, kann zu einer signifikanten Steigerung der 
Produktion von Spinosyn im Vergleich zur Spinosyn-Produktion des Stammes S. 
spinosa oder abgeleiteter Mutanten mit erhohter Spinosyn-Bildung ffihren. Zudem 
kann ein solcher, fur die Spinosyn-Biosynthese codierender Shuttle-Vektor genutzt 

20 werden, urn nach Ubertragung in heterologe Wirtszellen deren Biosynthese- und 
Modifizierungsleistung auszunutzen, urn zu einer signifikanten Modifizierung von 
Spinosyn oder Spinosyn-Biosynthesevorstufen zu gelangen. Hierdurch ist es zudem 
moglich, neue Spinosyn-Derivate durch den Transfer eines einzigen rekombinanten 
Vektors in heterologe Wirtszellen herzustellen. 

25 

Weiterhin konnen solche Shuttle- Vektoren verwendet werden, klonierte Biosynthese- 
wege von Sekundarmetaboliten als Bestandteil eines einzigen rekombinanten 
Shuttle-Vektors genetisch zu modifizieren. Solche Modifizierungen konnen z.B. in 
einem £ coli Wirt durchgefiihrt werden, z.B. unter Ausnutzung von Re- 
30 kombinationsereignissen unter Beteiligung des recA-Genproduktes oder der recE- 
und recT-Genprodukte (Muyrers et aL, 1999). Weiterhin konnen solche Vektoren 
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durch in vi/ro-Verfahren, wie z.B. das Template Generation System (Finnzymes, 
FIN-02201, Espoo, Finnland) oder das Transposomics-System (Epicentre 
Technologies, Biozym Diagnostika GmbH, Oldendorf, Deutschland) modifiziert 
werden. Solche, fur veranderte Biosynthesewege codierende Shuttle-Vektoren 
konnen dann in geeignete Wirtszellen ubertragen werden, urn zur Produktion ver- 
anderter Sekundannetabolite zu gelangen. In analoger Weise konnen die genannten 
Shuttle-Vektoren genutzt werden, die erfindungsgemaBen Nucleinsauren zu modifi- 
zieren, urn sie dann, nach Transfer in geeignete Wirtszellen zur Produktion ver- 
anderter Spinosyne einzusetzen. 

Teile der erfindungsgemaBen Nucleinsauren k6nnen auch als Bestandteil von zwei 
oder mehreren Vektoren, wie z.B. Cosmidvektoren, eine genetische Information 
determinieren, die in Kombination miteinander zur Biosynthese von Spinosyn oder 
Spinosyn-Vorstufen, wie z.B. Pseudoaglycon oder Spinosyn-Aglycon geeignet sind. 
Solche Kombinationen von rekombinanten Vektoren konnen eingesetzt werden, urn 
zu einer Spinosyn-Produktion in anderen Organismen als S. spinosa zu gelangen. 
Dies kann bei einer Expression in besonders geeigneten Wirten zu einer signifikanten 
Steigerung der Spinosyn-Produktion im Vergleich zu S. spinosa oder abgeleiteten 
produktionsverstSrkten Mutanten fuhren. Weiterhin ist es mdglich, erfindungsge- 
maBe Nucleinsauren in einzelnen rekombinanten Vektoren dieser Vektorkombination 
so zu verandern, dass eine heterologe Produktion von Spinosyn-Derivaten in Wirts- 
zellen mdglich ist. Desweiteren kann eine solche Kombination von rekombinanten 
Vektoren durch deren Transfer in heterologe Wirte zur Bildung neuer Spinosyn- 
Derivate unter Ausnutzung des wirtseigenen Enzymsystems geeignet sein. 

Gegenstand der vorliegenden Erfindung sind auch Wirtszellen, die zumindest eine 
der erfindungsgemaBen Nucleinsauren enthalten. Als Wirtszelle eignen sich sowohl 
prokaryotische Zellen, vorzugsweise Actinomyceten, besonders bevorzugt Strepto- 
myceten, als auch eukaryotische Zellen, wie Saugerzellen, Pflanzenzellen oder Hefe- 
zellen. 
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In besonderer Weise konnen die erfindungsgemaBen Nucleinsauren in pflanzliche 
Zellen iibertragen und exprimiert werden. Hierdurch konnen transgene Pflanzen her- 
gestellt werden, die das pflanzenschutzende, insektizide Spinosyn bzw. Derivate da- 
von produzieren. Eine Ubertragung der erfindungsgemaBen Nucleinsauren in die 
Pflanzenzellen oder pflanzliche Zellkulturen kann mit ublichen Verfahren u.a. auch 
durch Partikelbeschuss erfolgen. 

Gegenstand der vorliegendeh Erfindung sind weiterhin die Polypeptide, die von den 
erfindungsgemaBen Nucleinsauren codiert werden. Die erfindungsgemaBen Poly- 
peptide k6nnen ein vollstSndiges Enzym darstellen, das einen Schritt der Spinosyn- 
Biosynthese katalysiert. Jedoch sind auch solche Polypeptide von der Erfindung er- 
fasst, die nur einen Teil der vollstMndigen Aminosauresequenz des betreffenden 
Enzyms aufweisen. 

Der Ausdruck "Teilsequenz", wie er hierin verwendet wird, bezieht sich somit auf die 
Aminosauresequenz eines Polypeptids, das noch die Aktivitat des entsprechenden 
vollstandigen Enzyms oder einer enzymatisch aktiven DomSne ausiiben kann. 

Tm Folgenden werden bevorzugte erfindungsgemaBe Nucleinsauren und Polypeptide 
mit Bezug auf die entsprechenden SEQ ID NOS naher charakterisiert. 

SEQ ID NOS: 7 und 8, ORF1: 

Nucleotidpositon 828 bis 1 der SEQ ID NO: 4, 275 Aminosauren; 
das ableitbare Genprodukt ist eine Methyltransferase. 

SEQ ID NOS: 9 und 10, ORF2: 

Nucleotidposition 1.283 bis 2.455 der SEQ ID NO: 4, 390 Aminosauren; 
das ableitbare Genprodukt ist eine Glycosyltransferase. 
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SEQ ID NOS: 1 1 und 12, ORF3: 

Nucleotidposition 2.495 bis 3.247 der SEQ ID NO: 4, 250 Aminosauren; 
das ableitbareGenprodukt ist eine Methyltransferase. 

SEQ ID NOS: 13 und 14, ORF4: 

Nucleotidposition 4.440 bis 3.253 der SEQ ID NO: 4, 395 Aminosauren; 
das ableitbare Genprodukt ist eine Methyltransferase. 

SEQ ID NOS: 15 und 16, ORF5: 

Nucleotidposition 4.578 bis 6.197 der SEQ ID NO: 4, 539 Aminosauren; 

das ableitbare Genprodukt ist ein C-C verknupfendes Enzym, das Cyclisierungs- 

reaktionen durchfiihrt. 

SEQ ID NOS: 17 und 18, ORF6: 

Nucleotidposition 6.21 1 bis 7.404 der SEQ ID NO: 4, 397 Aminosauren; 
das ableitbare Genprodukt ist eine Methyltransferase. 

SEQ ID NOS: 19 und 20, ORF7: 

Nucleotidposition 7.401 bis 8.300 der SEQ ID NO: 4, 299 Aminosauren; 
das ableitbare Genprodukt ist eine Methyltransferase. 

SEQ ID NOS: 21 und 22, ORF8: 

Nucleotidposition 8.300 bis 9.466 der SEQ ID NO: 4, 388 Aminosauren; 

das ableitbare Genprodukt ist ein Enzym, das an Cyclisierungsreaktionen beteiligt ist. 

SEQ ID NOS: 23 und 24, ORF9: 

Nucleotidposition 10.572 bis 9.562 der SEQ ID NO: 4, 336 Aminosauren. 
das ableitbare Genprodukt ist eine 2,3-Reduktase. 
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SEQ ID NOS: 25 und 26, ORF10: 

Nucleotidposition 12.029 bis 10.569 der SEQ ID NO: 4, 486 Aminosauren; 
das ableitbare Genprodukt ist eine 2,3-Dehydratase. 

5 SEQ ID NOS: 27 und 28, ORF1 1 : 

Nucleotidposition 12.549 bis 12.109 der SEQ ID NO: 4, 146 Aminosauren; das 
ableitbare Genprodukt hat Homologien zu einer Thioesterase. 

SEP ID NOS: 29 und 30, ORF12: 
10 Nucleotidposition 13.865 bis 12.546 der SEQ ID NO: 4, 439 Aminosauren; 
das ableitbare Genprodukt ist eine Glykosyltransferase. 

SEP ID NOS: 31 und 32, ORF13: 

Nucleotidposition 14.245 bis 15.633 der SEQ ID NO: 4, 462 Aminosauren; 
1 5 das ableitbare Genprodukt ist eine 3,4-Dehydratase. 

SEQ ID NOS: 33 und 34, ORF14: 

Nucleotidposition 15.671 bis 16828 der SEQ ID NO: 4, 385 Aminosauren; 
das ableitbare Genprodukt ist eine 4-Aminotransferase. 

20 

SEQ ID NO: 35 und 36, ORF15: 

Nucleotidposition 16.831 bis 17.580 der SEQ ID NO: 4, 249 Aminosauren; 
das ableitbare Genprodukt ist eine N-Dimethyltransferase. 

25 SEQ ID NOS: 37 und 38, ORF16: 

Nucleotidposition 1 8.930 bis 1 8.205 der SEQ ID NO: 4, 241 Aminosauren; 
das ableitbare Genprodukt ist eine 3,4-Reduktase. 

SEQ ID NOS: 39 und 40, ORF17: 
30 Nucleotidposition 19.025-19.861 der SEQ ID NO: 4, 278 Aminosauren; 
das ableitbare Genprodukt ist ein Transkriptions-Regulator. 
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SEQ ID NOS: 41 unci 42, ORF18: 

Nucleotidpositionen 1 16-7903 der SEQ ID NO: 5, Aminosaurepositionen 1 bis 
2595: 

5 Nucleotidpositionen 128-1402, Aminosaure Positionen 5-429 codieren eine P- 
Ketoacyl:Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 1691-2656, Aminosaurepositionen 526-847 codieren eine 
Acyltransferase-Domane; 

Nucleotidpositionen 2798-3052, Aminosaurepositionen 895-979 codieren eine P- 
1 0 Ketoacyl: Acyl Carrier Protein-Domane; 

Nucleotidpositionen 3107-4372, Aminosaurepositionen 998-1419 codieren eine P- 
Ketoacyl:Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 4688-5662, Aminosaurepositionen 1525-1849 codieren eine 
Acyltransferase-Domane; 
15 Nucleotidpositionen 6587-7138, Aminosaurepositionen 2158-2341 codieren eine 
Ketoreduktase-Domane; 

Nucleotidpositionen 7409-7666, Aminosaurepositionen 2432-2517 codieren eine P- 
Ketoacyl: Acyl Carrier Protein-Domane. 

20 SEQ ID NOS: 43 und 44, ORF19: 

Nucleotidpositionen 7921-14379 der SEQ ID NO: 5, Aminosaurepositionen 1 bis 
2152: 

Nucleotidpositionen 8029-9318, Aminosaurepositionen 37-466 codieren eine p- 
Ketoacyl:Acyl Carrier Protein Synthase-Domane; 
25 Nucleotidpositionen 9634-10608, Aminosaurepositionen 572-896 codieren eine 
Acyltransferase-Domane; 

Nucleotidpositionen 10705-11259, Aminosaurepositionen 929-1113 codieren eine 
Dehydratase-Domane; 

Nucleotidpositionen 12043-13080, Aminosaurepositionen 1375-1720 codieren eine 
30 Enoylreduktase-Domane; 
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Nucleotidpositionen 13093-13635, Aminosaurepositionen 1725-1905 codieren eine 
Ketoreduktase-Domane; 

Nucleotidpositionen 13885-14142, Aminosaurepositionen 1989-2074 codieren eine 
p-Ketoacyl:Acyl Carrier Protein-DomSne. 

5 

SEQ ID NOS: 45 und 46, ORF20: 

Nucleotidpositionen 14424-23936 der SEQ ID NO: 5, Aminosaurepositionen 1 bis 
3170: 

Nucleotidpositionen 14523-15824, Aminosaurepositionen 34-467 codieren eine p- 
1 0 Ketoacyl: Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 16110-17075, Aminosaurepositionen 563-884 codieren eine 
Acyltransferase-Domane; 

Nucleotidpositionen 17997-18536, Aminosaurepositionen 1192-1371 codieren eine 
Ketoreduktase-Domane; 
15 Nucleotidpositionen 18795-19052, Aminosaurepositionen 1458-1543 codieren eine 
P-Ketoacyl: Acyl Carrier Protein-Domane; 

Nucleotidpositionen 19107-20387, Aminosaurepositionen 1562-1988 codieren eine 
P-Ketoacyl:Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 20718-21692, Aminosaurepositionen 2099-2423 codieren eine 
20 Acyltransferase-Domane; 

Nucleotidpositionen 22620-23171, Aminosaurepositionen 2733-2916 codieren eine 
Ketoreduktase-Domane; 

Nucleotidpositionen 23436-23693, Aminosaurepositionen 3005-3090 codieren eine 
p-Ketoacyl:Acyl Carrier Protein-Domane. 

25 

SEP ID NOS: 47 und 48, ORF21 : 

Nucleotidpositionen 23983-38757 der SEQ ID NO: 5, Aminosaurepositionen 1 bis 
4924: 

Nucleotidpositionen 24082-25392, Aminosaurepositionen 34-470 codieren eine p- 
30 Ketoacyl: Acyl Carrier Protein Synthase-Domane; 
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Nucleotidpositionen 25696-26661, Aminosaurepositionen 572-893 codieren eine 
Acyltransferase-Doraane; 

Nucleotidpositionen 26761-27315, Aminosaurepositionen 927-1111 codieren eine 
Dehydratase-Domane; 

5 Nucleotidpositionen 28231-28782, Aminosaurepositionen 1417-1600 codieren eine 
Ketoreduktase-Domane; 

Nucleotidpositionen 29035-29265, Aminosaurepositionen 1685-1761 codieren eine 
P-Ketoacyl:Acyl Carrier Protein-DomSne; 

Nucleotidpositionen 29329-30624, Aminosaurepositionen 1783-2214 codieren eine 
1 0 P-Ketoacyl: Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 30928-31902, Aminosaurepositionen 2316-2640 codieren eine 
Acyltransferase-Domane; 

Nucleotidpositionen 32827-33378, Aminosaurepositionen 2949-3132 codieren eine 
Ketoreduktase-Domane; 
15 Nucleotidpositionen 33652-33900, Aminosaurepositionen 3224-3306 codieren eine 
P-Ketoacyl:Acyl Carrier Protein-Domane; 

Nucleotidpositionen 33952-35262, Aminosaurepositionen 3324-3760 codieren eine 
p-Ketoacyl:Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 35554-36522, Aminosaurepositionen 3858-4180 codieren eine 
20 Acyltransferase-Domane; 

Nucleotidpositionen 37453-37998, Aminosaurepositionen 4491-4672 codieren eine 
Ketoreduktase-Domane; 

Nucleotidpositionen 38254-38511, Aminosaurepositionen 4758-4843 codieren eine 
P-Ketoacyl: Acyl Carrier Protein-Domane. 

25 

SEQ ID NOS: 49 und 50, ORF22: 

Nucleotidpositionen 38808-50000 der SEQ ID NO: 5 und die Nukleotidpositionen 1 
bis 5574 der SEQ ID NO: 6, Aminosaurepositionen 1 bis 5588: 
Nucleotidpositionen 38907-40226 der SEQ ID NO: 5, Aminosaurepositionen 34-473 
30 codiert eine p-Ketoacyl:Acyl Carrier Protein Synthase-Domane; 
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Nucleotidpositionen 40494-41453 der SEQ ID NO: 5, Aminosaurepositionen 563- 
882 codieren eine Acyltransferase-Domane; 

Nucleotidpositionen 41556-42119 der SEQ ID NO: 5, Aminosaurepositionen 917- 
1 104 codieren eine Dehydratase-Domane; 
5 Nucleotidpositionen 43017-43568 der SEQ ED NO: 5, Aminosaurepositionen 1404- 
1587 codieren eine Ketoreduktase-Domane; 

Nucleotidpositionen 43833-44090 der SEQ ID NO: 5, Aminosaurepositionen 1676- 
1761 codieren eine P-Ketoacyl:Acyl-Carrier Protein Domane; 
Nucleotidpositionen 44151-45473 der SEQ ID NO: 5, Aminosaurepositionen 1782- 
10 2222 codieren eine p-Ketoacyl:Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 45765-46730 der SEQ ID NO: 5, Aminosaurepositionen 2320- 
2641 codieren eine Acyltransferase-Domane; 

Nucleotidpositionen 46827-47459 der SEQ ID NO: 5, Aminosaurepositionen 2674- 
2884 codieren eine Dehydratase-Domane; 
15 Nucleotidpositionen 48378-48935 der SEQ ID NO: 5, Aminosaurepositionen 3191- 
3376 codieren eine Ketoreduktase-Domane; 

Nucleotidpositionen 49182-49412 der SEQ ID NO: 5, Aminosaurepositionen 3459- 
3535 codieren eine P-Ketoacyl:Acyl Carrier Protein-Domane; 
Nucleotidpositionen 49482-50000 der SEQ ID NO: 5 und Nucleotidposition 1 bis 
20 759 der SEQ ID NO: 6, Aminosaurepositionen 3559-3984 codieren eine 0- 
Ketoacyl: Acyl Carrier Protein Synthase-Domane; 

Nucleotidpositionen 1084-2049 der SEQ ID NO: 6, Aminosaurepositionen 4093- 
4414 codieren eine Acyltransferase-Domane; 

Nucleotidpositionen 2146-2697 der SEQ ID NO: 6, Aminosaurepositionen 4447- 
25 4630 codieren eine Dehydratase-Domane; 

Nucleotidpositionen 3604-4155 der SEQ ID NO: 6, Aminosaurepositionen 4933- 
51 16 codieren eine Ketoreduktase-Domane; 

Nucleotidpositionen 4420-4677 der SEQ ID NO: 6, Aminosaurepositionen 5205- 
5290 codieren eine P-Ketoacyl: Acyl Carrier Protein-Domane; 
30 Nucleotidpositionen 4864-5538 der SEQ ID NO: 6, Aminosaurepositionen 5353- 
5577 codieren eine Thioesterase-Domane. 
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SEQ ID NOS: 52 und S3, ORF23: 

Nucleotidposition 344 bis 1333 der SEQ ID NO: 51, 329 Aminosauren; 
das ableitbare Genprodukt ist eine dNDP-Glucose-4,6-Dehydratase. 

5 

SEQ ID NOS: 54 und 55, ORF24: 

Nucleotidposition 1330 bis 2247 der SEQ ED NO: 51, 305 Aminosauren; 

das ableitbare Genprodukt ist eine dNDP-4-keto-6-Deoxyglucose-3,5-Epimerase. 

10 Die an der Cyclisierung des 5, 6, 5- Tricyclus beteiligten Produkte des ORF 5 (SEQ 
ID NO: 16) und des ORF 8 (SEQ ID NO: 22) sind aufgrund der ungewohnlichen 
Cyclisierungsreaktionen von besonderem Interesse. Daher beinhaltet die vorliegende 
Erfindung insbesondere auch homologe Nucleinsauren oder homologe Genprodukte. 
Vorzugsweise zeigen diese homologen Genprodukte mindestens eine 50%ige, 

15 bevorzugt eine 60 %ige und besonders bevorzugt eine 70 %ige Identitat auf Amino- 
saureebene. 

Weiterhin sind Antikorper Gegenstand der Erfindung, die spezifisch an die 
yorstehend genannten Polypeptide binden. Die Herstellung solcher Antikorper erfolgt 
20 auf die tibliche Weise. Diese Antikorper konnen genutzt werden, um Expres- 
sionsklone z.B. einer Genbank zu identifizieren, die die erfindungsgemaBen Nuclein- 
sauren tragen. 

Gegenstand der yorliegenden Erfindung sind auch Verfahren zum Herstellen der 
25 erfindungsgemaBen Nucleinsauren. Die erfindungsgemaBen Nucleinsauren kflnnen 
auf die tibliche Weise hergestellt werden. Beispielsweise konnen die Nucleinsaure- 
molekule vollstSndig chemisch synthetisiert werden. Man kann auch kurze Stucke 
der erfindungsgemaBen Nucleinsauren chemisch synthetisieren und solche 
Oligonucleotide radioaktiv oder mit einem Fluoreszenzfarbstoff markieren. Die 
30 markierten Oligonucleotide konnen auch verwendet werden, um Genbanken von 
Organismen zu durchsuchen. Klone, an die die markierten Oligonucleotide hybridi- 
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sieren, werden zur Isolierung der betreffenden DNA ausgewahlt. Nach der Charak- 
terisierung der isolierten DNA erhSlt man auf einfache Weise die erfindungsgemaBen 
Nucleinsauren. Die erfindungsgemaBen Nucleinsauren konnen auch mittels PCR- 
Verfahren unter Verwendung chemisch synthetischer Oligonucleotide hergestellt 
5 werden. 

Gegenstand der vorliegenden Erfindung sind weiterhin Verfahren zum Herstellen der 
erfindungsgemaBen Polypeptide. Zur Herstellung der Polypeptide, die von den erfin- 
dungsgemaBen Nucleins&uren codiert werden, kfinnen WirtS2ellen, die zumindest 
10 eine der erfindungsgemaBen Nucleinsauren enthalten, unter geeigneten Bedingungen 
kultiviert werden. Die gewQnschten Polypeptide ktfnnen danach auf iibliche Weise 
aus den Zellen oder dem Kulturmedium isoliert werden. Die Polypeptide k6nnen 
auch in in v/Tro-Systemen hergestellt werden. 

15 Das isolierte und charakterisierte Gencluster und benachbarte oder assoziierte DNA- 
Regionen stellen ein Target zur Steigerung der Spinosyn-Biosynthese durch 
genetische Manipulation, Uber- oder Unterexpression von direkt oder indirekt an der 
Biosynthese involvierten Genen oder regulatorischen Sequenzen dar. Diese Mani- 
pulationen kSnnen sowohl in naturlichen Spinosyn-produzierenden Organismen als 

20 auch in gentechnisch hergestellten Spinosyn-produzierenden Organismen durchge- 
fiihrt werden. Beispielsweise konnen ausgewahlte ORF's unter die Kontrolle flblicher 
starker Promotoren wie dem me/-Promotor des Plasmides pIJ702 (John Innes 
Foundation, Norwich, UK, 1985) gestellt werden. 

25 Durch die Klonierung und Identifizierung der Spinosyn-Biosynthesegene schafft die 
vorliegende Erfindung die genetische Basis, mittels molekulargenetischer Verfahren 
neue Spinosyn-Vorstufen und -Derivate herzustellen. 

Der Ausdruck "Spinosyn-Vorstufen", wie er hierin verwendet wird, bezieht sich auf 
30 alle nachweisbaren und alle postulierbaren Biosynthesevorstufen von Spinosyn. 
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Der Ausdruck "Spinosyn-Derivate", wie er hierin verwendet wird, bezieht sich auf 
Strukturderivate aller bisher bekannten Spinosyne. 

Gegenstand der vorliegenden Erfindung ist somit auch ein Verfahren zum Herstellen 
5 von Spinosyn-Vorstufen und -Derivaten. 

Die erfindungsgemafien Nucleinsauren konnen beispielsweise eingesetzt werden, um 
durch kombinatorische Biosynthese neue Spinosyn-Derivate mit Veranderungen des 
Spinosyn-Aglycons herzustellen. Dies kann z.B. dadurch erreicht werden, dass die 

10 von ORF 19 codierte, eine Acetat-Einheit einbauende Acyltransferase-Domane aus- 
getauscht wird gegen eine Acyltransferase-Domane, die eine Propionat-Einheit ein- 
baut. In gleicher Weise kann die, eine Acetat-Einheit einbauende Acyltransferase- 
Domane des ORF 18 gegen eine Acyltransferase-Domane ausgetauscht werden, die 
eine Propionat-Einheit einbaut Femer ist es moglich beide oder jeweils eine 

15 Ketoreduktase-Domane, die von beiden genannten ORF's codiert werden zu in- 
aktivieren, durch eine inaktive Ketoreduktase-Domane zu ersetzen oder zu deletieren, 
wodurch eine Hydroxygruppe an der entsprechenden Position im Makrocyclus bio- 
synthetisch hergestellt werden kann. Alle Acyltransferase-, Ketoreduktase-, 
Dehydratase-, Enoylreduktase-, P-Ketoacyl:Acyl Carrier Protein und Thioesterase- 

20 Domanen k6nnen einzeln oder in beliebiger Kombination durch entsprechende Poly- 
ketidsynthase-Domanen mit anderer Substrat- oder Reaktionsspezifitat ersetzt 
werden, in beliebiger Kombination miteinander fiisioniert, einzeln oder in beliebiger 
Kombination mutagenisiert, deletiert oder dupliziert werden. Ferner kSnnen Modul- 
codierende Sequenzen ausgetauscht werden. So ist es denkbar die Modul 2- 

25 codierende DNA-Sequenz (Abb. 6) gegen die Modul 1- oder Modul 3, 4, 5, 6, 7, 8- 
oder Modul 9-codierende DNA-Sequenz (Abb. 6) zu ersetzen und funktionell zu 
exprimieren. Es ist auch denkbar die Modul 2-codierende DNA-Sequenz oder jede 
andere Modul-codierende DNA-Sequenz des Spmosyn-Polyketidsynthase-Gen- 
clusters gegen eine andere Modul-codierende DNA-Sequenz des Spinosyn-Poly- 

30 ketidsynthase-Genclusters, die eine andere biosynthetische Verlangerungseinheit ein- 
baut, auszutauschen. Dariiber hinaus kann jede andere Modul-codierende DNA- 
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Sequenz des Spinosyn-Polyketidsynthase-Genclusters gegen eine andere Modul- 
codierende DNA-Sequenz einer anderen Polyketidsynthase-NukleinsSuresequenz aus 
S. spinosa oder einem anderen Organismus als S. spinosa, wie z.B. 
Saccharopolyspora erythraea, ausgetauscht werden. Diese VerMnderungen konnen 
5 unter Ausnutzung der ET-Rekombination (WO 99/29837; Muyrers et al., 1999) oder 
anderer Klonierungs- und Rekombinationstechniken durchgefiihrt werden. 

Gegenstand der Erfindung sind somit auch alle Modul- oder Domanen-codierenden 
Nucleinsauren, die nattirlicher oder gentechnisch erzeugter Bestandteil der Spinosyn- 
10 Polyketidsynthase sind. 

Der Ausdruck "Modul", sowie er hierin verwendet wird, bedeutet, dass eine Anord- 
nung von drei enzymkatalytisch aktiven DomSnen vorliegt, die zu eine Verlangerung 
der wachsenden Polyketidkette um eine biosynthetische VerlSngerungseinheit 

15 fuhren. Bei diesen DomSnen handelt es sich um eine p~Ketoacyl:Acyl Carrier Protein 
Synthase-Domane, eine Acyltransferase-Domane und eine P-Ketoacyl:Acyl-Carrier 
Protein-DomSne. Ein Modul kann auch eine Ketoreduktase-, eine Dehydratase-, eine 
Enoylreduktase- und eine Thioesterase-Domane tragen. Ein sog. Ladungsmodul, das 
am Beginn der Biosynthese steht kann von den genannten Domanen lediglich eine 

20 Acyltransferase-Domane und eine p-Ketoacyl:Acyl-Carrier Protein-Domane tragen, 
sowie eine enzymatisch inaktive p-Ketoacyl:Acyl Carrier Protein Synthase-Domane. 
Eine Polyketidsynthase-Domane umfasst jeweils eine dieser genannten enzyma- 
tischen Aktivitaten. 

25 Die erfindungsgemaBen Nucleinsauren konnen weiterhin genutzt werden, um im 
Zuge einer kombinatorischen Biosynthese durch Neuanordnung und Expression von 
Spinosyn-Polyketidsynthase-Nucleinsauresequenzen oder durch {Combination und 
Expression zusammen mit Polyketidsynthase-Nucleinsauresequenzen einer anderen 
Polyketidsynthase codierenden Nucleinsauresequenz aus S. spinosa oder einem 

30 anderen Organismus, wie z.B. Saccharopolyspora erythaea, Bibliotheken von re- 
kombinanten Polyketidsynthase-Nucleinsauresequenzen, rekombinanten Polyketid- 
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synthase-Proteinen oder rekombinant erzeugten Polyketiden herzustellea Diese 
Polyketide konnen durch die Verwendung der erfindungsgemaBen Nucleinsauren 
oder die Verwendung anderer Nucleinsauren, deren ableitbaren Produkte an der Bio- 
synthese anderer Zucker und Ankopplung ans Aglycon beteiligt sind, glycosyliert 
5 werden. Es ist bekannt, dass die Glycosylierung des Aglycons eine entscheidende 
Rolle bei der biologischen Aktivtat am Wirkort spielt. Diese VerSnderungen konnen 
sowohl in naturlichen als auch in gentechnisch hergestellten Spinosyn-produzie- 
renden Organismen, insbesonders Bakterien, erfolgen. Weiterhin k6nnen diese Ver- 
anderungen unter Ausnutzung der ET-Rekombination (WO 99/29837; Muyrers et al., 
10 1999) oder anderer Klonierungs- und Rekombinationstechniken durchgefiihrt 
werden. 

Die erfindungsgemaBen Nucleinsauren, Vektoren und regulatorischen oder genetisch 
mobilen Regionen kcmnen auBerdem zum Auffinden von Genen verwendet werden, 
15 die fur Polypeptide codieren, welche fiinktionell ahnliche Polyketidsynthasen oder 
fiinktionell ahnliche Produkte, die an einer Zuckerbiosynthese beteiligt sind, 
codieren. 

Da die erfindungsgemSBen Nucleinsauren einen umfangreichen Teil des Genoms von 
20 S. spinosa ausmacht, k6nnen die erfindungsgemaBen Nucleinsauren als Marker bei 
der Sequenzierung des Genoms von S. spinosa eingesetzt werden, wodurch die An- 
ordnung von Teilsequenzen eines Genomsequenzierungsprojektes erhebhch er- 
leichtert wird. 

25 Somit liefem die erfindungsgemaBen Nucleinsauren Daten, die im Rahmen eines 
Genomsequenzierungsprojektes und eines sich darauf aufbauenden Metabohc 
Engineering zur Steigerung der Spinosynproduktion eingesetzt werden kdnnen. 
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Erlauterungen zu den Abbildungen: 

Abbildung 1 : Modell fiir die Biosynthese der Spinosyn-Zucker D-Forosamin und 2, 
3, 4-Tri-O-Methyl-L-Rhamnose. 

5 

Abbildung 2: Lage, der an der Spinosyn-Biosynthese direkt oder indirekt beteiligten 
DNA-Regionen 1 (SEQ ID NO: 4) und DNA-Region 2 (SEQ ID NOS: 5 und 6). Die 
schwarzen Balken im unteren Teil der Abbildung geben schematisch die Positonen 
der Cosmid-DNA Inserts zueinander und in Bezug zu den DNA-Regionen 1 und 2 
10 an. Die dargestellen Cosmid-Inserts wurden zur Sequenzierung der SEQ ID NOS: 1 
bis 3 herangezogen. 

Abbildung 3: Schematische Darstellung der Lage der Insert-DNA (schwarze Balken 
im unteren Teil der Abbildung) der benannten Cosmide, die zur Ankopplung eines 
15 Forosamin-Restes oder eines Trimethyl-Rhamnose-Restes durch Biotransformation 
des Spinosyn-Aglycons und Spinosyn-Pseudoaglycons herangezogen worden sind. 

Abbildung 4: Schematische Darstellung der offenen Leserahmen (ORF's) der DNA- 
Region 3, die der SEQ ID NO: 5 1 entspricht, auf Cosmid 16-2-2. 

20 

Abbildung 5: Schematische Darstellung offener Leserahmen (ORF's) der DNA- 
Regionen 1 und 2. Die ORF's sind nummeriert von 1 bis 22 entsprechend SEQ ID 
NOS: 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47 
und 49. 

25 

Abbildung 6: Schematische Darstellung offener Leserahmen (ORF's) der DNA- 
Region 2 (SEQ ID NOS: 5 und 6) und ableitbarer Module und DomSnen. SM, 
Startmodul; Ml bis M10, Modul 1 bis Modul 10; KS, p-Ketoacyl:Acyl Carrier 
Protein Synthase; AT, Acyltransferase; ACP, p-Ketoacyl:Acyl Carrier Protein; KR, 
30 Ketoreduktase; DH, Dehydratase; ER, Enoylreduktase; TE, Thioesterase. 
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Abbildung 7: Schematische Darstellung der Lage von BAC-Shuttleklon Insert-DNA 
als schwarze Balken im unteren Teil der Abbildung. Die GroBe der Insert-DNA 
betragt mindestens 100 kb. Durchgezogene Balken: DNA-Sequenz ist identisch mit 
Teilen der DNA-Region 1 und der Gesamtheit der DNA-Region 2 (P11/G6 und 
5 PI 1/B10) bzw. mit der Gesamtheit der DNA-Region 1 und Teilen der DNA-Region 
2 (P8/G11). Gestrichelte Balken: DNA-Sequenz liegt auBerhalb des sequenzierten 
Bereichs. 
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Beispiele 

Bakterienstamme und Plasmide 

Escherichia coli XLl-Blue MRF' und die Cosmidvektoren SuperCosl (Stratagene, 
5 Europe) und pOJ446 (Biermann et al., 1992) wurden verwendet zur Erstellung von 
Genbanken von S. spinosa ATCC49460 (American Type Culture Collection, U.S.A., 
EP-A 0 375 316). E. coli JM110 (Stratagene, Europe) wurde verwendet zur Propa- 
gierung von Plasmiden, die durch Transformation nach Streptomyces ubertragen 
wurden. Streptomyces albus J1074 (Chater and Wilde, 1980; John Innes Institut in 
10 Norwich, UK) wurde zur heterologen Expression von und zur Biotransformation mit 
Spinosyn-Biosynthesegenen eingesetzt. 

Plasmid pBeloBACl 1 (Kim et al., 1996) und pOJ446 (Biermann et al., 1992) wurden 
verwendet zur Herstellung eines E. coli - Streptomyces BAC-Shuttlevektors. 

15 

Molekularbiologische Methoden 

Molekularbiologische Methoden wie DNA-Restriktion, Agarose-Gelelektrophorese 
von DNA, Ligation von Restriktionsfiragmenten, Kultivierung und Transformation 
20 von & coli wurden durchgefuhrt wie beschrieben in Sambrook et. al (1989). 
Plasmide wurden mit Qiagen Plasmid Kit (Qiagen, Hilden, Deutschland) isoliert. Die 
verwendeten Enzyme stammten von Roche Diagnostics GmbH (Mannheim, 
Deutschland). 

25 Anzucht Bedingungen and molekulargenetische Methoden mit S. spinosa und 
Streptomyceten sind beschrieben in (Hopwood et al., 1985). AUe Anzuchten in 
Flussigkultur von S. spinosa oder Streptomyceten erfolgten aerob in Erlenmeyer- 
kolbenbei 28°C. 
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Die DNA-DNA-Hybridisierungen erfolgten unter Verwendung des DIG-High Prime 
DNA Labeling and Detection Kit nach Angaben des Herstellers (Roche Diagnostics 
GmbH, Mannheim, Deutschland). 

5 Wachstumsmedien: 

LB Sambrook et. al, 1989 

TS Difco Bestell-Nummer 0 370-17-3 (Difco Detroit, MI, USA) 

10 

R5A Illingetal.,1985 

Herstellung einer Cosmid Genbank von S. spinosa 

15 Um eine Genbank von S. spinosa zu erhalten, wurde chromosomale DNA von S. 
spinosa ATCC49460 mit Mbol partiell geschnitten und durch Zentrifugation im 
Glucosedichtegradienten aufgetrennt. Die Cosmid-DNA (SuperCosl, Stratagene 
Europe) wurde nach Angaben des Herstellers vorbereitet, mit den & spinosa DNA- 
Fragmenten zwischen 35 und 45 kb ligiert und mit Hilfe des Gigapack-Verpackungs- 

20 system (Stratagene Europe) in Phagenpartikel verpackt. Die Transfektion erfolgte in 
E. coli XL-1 blue MRF\ Diese Methode wurde ebenfalls dazu verwendet eine zweite 
S. spinosa Genbank anzulegen unter Verwendung des R colUStreptomyces Shuttle- 
Cosmids pOJ446. 

25 Sequenzierung des Spinosynbibsynthese-Genclusters und eines DNA-Fragmentes 
das auBerhalb dieses Clusters liegt, dessen Produkte aber an der Biosynthese von 
Spinosyn beteiligt sind 



30 



Die Insert-DNA der SuperCosl Cosmide 16-1-8, 16-59-1, und 16-59-8 wurden 
sequenziert Eine ca. 4 kb groBe Lucke zwischen den Cosmiden 16-59-1 und 16-1-8 
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wurde durch pimer walking Sequenzierung eines entsprechenden Teilbereiches von 
Cosmid 16-59-6 geschlossen. 

Eine ca. 2,3 kb groBe DNA-Sequenz auf dem SuperCosl Cosmid 16-2-2 wurde 
5 sequenziert 

Identifizierung und Charakterisierung von chromosomalen DNA-Fragmenten einer 
BAC-Shuttlevektor-Genbank aus S. spinosa. die Spinosvn-Biosvnthesegensequenzen 
tragen 

10 

Zur Herstellung des BAC-Shuttlevektors, der nicht nur in E. coli sondern auch in 
Actinomyceten wie Streptomyces iibertrageh und vermehrt werden kann, wurde der 
Vektor pBeloBACll mit Xhol linearisiert, und durch die Anwendung von Klenow 
Polymerase wurden glatte DNA-Enden hergestellt. Ein ca. 6 kb groBes Dral - 
15 EcoRV DNA-Fragment des Cosmidvektors pOJ446, das den Replikationsursprung 
des Plasmides SCP2*, das Apramycinresistenzgen sowie den oriT zum konjugativen 
Transfer tragt, wurde mit dem linearisierten BAC- Vektor ligiert. Der resultierende 
Vektor erhielt die Bezeichnung pEBZ333. 

20 Ausgehend von partiell mit Mbol geschnittener genomischer DNA des Stammes S. 
spinosa ATCC49460 und dem mit BamUl geschnittenen Vektor pEBZ333 wurde 
eine BAC-Genbank erstellt. 

Analyse und Annotation offener Leserahmen direkt oder indirekt an der Spinosyn- 
25 Biosynthese beteiligter DNA-Sequenzen 

Ausgehend von der Sequenz gemaB SEQ ID NOS: 1 bis 3 wurden offene Lese- 
rahmen (ORF's) identifiziert, die direkt oder indirekt an der Biosynthese von 
Spinosyn beteiligt sind. Diese ORF's wurden in zwei DNA-Regionen unterteilt, die 
30 als DNA-Region 1 und DNA-Region 2 (Abb. 2 und 5) bezeichnet werden und die 
Sequenzen gemaB SEQ ID NO: 4 bzw. 5 und 6 tragen. Die DNA-Region 1 tragt 
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offehe Leserahmen, deren Produkte an der Modifiziemng und Tricyclusbildung des 
Spinosyn-Aglycons beteiligt sind, wahrend die DNA-Region 2 (Abb. 2, 5 und 6) 
offene Leserahmen umfasst, deren Produkte die Spinosyn-Polyketidsynthase 
codieren. Die beiden jeweils ersten Nucleotide dieser DNA-Regionen liegen un- 
5 mittelbar nebeneinander (Abb. 2, 3 und 5). 

Eine weitere DNA-Region 3 (SEQ ID NO: 51) liegt auBerhalb dieses Clusters von 
DNA-Sequenzen und tragt offene Leserahmen, deren Produkte ebenfalls an der Bio- 
synthese des Spinosyn-Zuckers Trimethyl-Rhamnose beteiligt sind. 

10 

Herstellung des Spinosyn-Aglycons und 17-Pseudoaglycons aus Tracer® 

Ausgehend von 18,7 g des kommerziell erhaltlichen Produktes Tracer® wurden nach 
Gefriertrocknung und Saulenchromatographie an Kieselgel 8,92 g Spinosyn A und D 
1 5 in einem Verhaitnis von 82: 1 8 gewonnen. 

Die Hydrolyse des Aminozuckers Forosamin gelang mit 2.7 N Schwefelsaure in 
Ethanol unter Rtickflufl. Dabei fiel der GroBteil des entstehenden 17-Pseudoaglycons 
von Spinosyn AID aus. Im Filtrat wurden neben weiterem 17-Pseudoaglycon in Ab- 
hSngigkeit von der Reaktionsdauer geringe bis mittlere Mengen des Spinosyn- 
Aglycons gefiinden. 

Eine vollstandige Hydrolyse zum Aglycon gelang unter etwas drastischeren Be- 
dingungen (7.2 N Schwefelsaure in Methanol unter RuckfluB). Die Aglycon-Fraktion 
enthielt ausschlieBlich Aglycon von Spinosyn A. Dies stimmt sehr gut mit der 
Literatur ttberein (Creemer et al., 1998X die unter entsprechenden Reaktionsbe- 
dingungen eine vollstandige Zersetzimg des Pseudoaglycons von Spinosyn D be- 
schreibt Als Ursache nehmen die Autoren eine leichtere Protonierung der 5,6- 
Doppelbindung bei Spinosyn D unter Bildung eines tertiaren Carbokations und an- 
schlieBende Umlagerungen an. 
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Es konnten somit ausgehend von 18,7 g kauflichem Tracer® 3,0 g Aglycon von 
Spinosyn A hergestellt werden. 



Tracer (Dow AgroScience) 
18.7g 



i 



Gefriertrocknung 



grauer Feststoff 
10.0 g 



1" 



I 



Chromatographie 




Spinosyn MO (A. R = H, 82 %; D, R = CH 3 , 18 %) 
8.92 g 



i 



69% 



OH 




MeO Me 



2.7NH2S0 4 /EtOH 
3 h ROckflua 



OH 



16% 




17-Pseudoaglycon von Spinosyn A/D 

(A, R = H, 82 %; D, R = CH 3 , 18 %) 



Aglycon von Spinosyn A 



64% 



1 



7.2NH 2 S0 4 /MeOH 
3 h RQckfluR 
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Gewinnung von Spinosyn A/D aus Tracer 

Die Gefriertrocknung von 18,7 g Tracer® lieferten 10,0 g grauen Feststoff. Nach 
Saulenchromatographie dieses Feststoffes an 800 cm 3 Kieselgel (Eluent: Dichlor- 
5 methan/Methanol 95:5) erhielt man 8,92 g reines Spinosyn A/D (82 % A, 18 % D). 
-DC: R f (Si0 2 , Dichlonnethan/Methanol 9:1) = 0,46. - l H-NMR: CDC1 3 , 5 = 6,77 
(s, 13-H); 5,88 (d, 5-H von Spinosyn A); 5,80 (m, 6-H von Spinosyn A); 5,49 (m, 
5-H von Spinosyn D); 4,87 (d, l'-H); 4,67 (m, 21-H); 4,43 (d, 1"-H); 4,31 (m, 9-H) 
u. a. - LC/MS: Elektrospray Positiv; Peak bei RT 44,0 min: m/z = 733 (100 %) 
10 [M+H] + (Spinosyn A); Peak bei 44,7 min: m/z - 747 (100 %) [M+H]+ (Spinosyn D). 

Darstellung des 17-Pseudoaglycons von Spinosyn A/D: 

8,65 g (1 1,81 mmol) Spinosyn A/D wurden in 61 ml Ethanol gel6st und mit 104 ml 

15 Wasser und 208 ml 4 N H2SO4 versetzt. Nach 3 h Erwarmen unter RiickfluB wurde 
der ausgefallene Feststoff (A) abfiltriert und getrennt vom Filtrat (B) aufgearbeitet 
Der Feststoff (A) wurde mit 1 N H2SO4 gewaschen, in 140 ml Dichlormethan aufge- 
nommen, nacheinander mit gesatt. NaHC03-L6sung und gesatt. NaCl-Losung ge- 
waschen, iiber Na 2 S04 getrocknet und im Vakuum eingeengt. Umkristallisation aus 

20 Ethanol lieferten 3,03 g 17-Pseudoaglycon von Spinosyn A/D und Mutterlauge (C). 
Das Filtrat (B) wurde mehrmals mit Dichlormethan extrahiert. Die Extrakte wurden 
nacheinander mit gesatt. NaHC0 3 -L5sung und ges&tt. NaCl-L6sung gewaschen, iiber 
Na2S0 4 getrocknet und im Vakuum eingeengt. Der Riickstand wurde vereint mit der 
im Vakuum eingeengten Mutterlauge (C) und durch Saulenchromatographie an 

25 650 cm 3 Kieselgel (Eluent: Cyclohexan/Essigsaureethylester 1:1, dann 100 % Essig- 
saureethylester) aufgetrennt. Man erhielt neben weiteren 1,76 g 17-Pseudoaglycon 
von Spinosyn A/D 0,78 g (16 %) Aglycon von Spinosyn A. Die Gesamtausbeute von 
17-Pseudoaglycon von Spinosyn A/D betrug 4,79 g (69 %). - a) 17-Pseudoaglycon 
von Spinosyn A/D (82 % A, 18 % D): DC: Rf (Si0 2) Essigsaureethylester) = 0,48. 

30 -*H-NMR: CDC1 3 , 8 = 6,78 (s, 13-H); 5,88 (d, 5-H von Spinosyn A); 5,80 (m, 6-H 
von Spinosyn A); 5,49 (m, 5-H von Spinosyn D); 4,86 (d, l'-H); 4,70 (m, 21-H); 
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4,32 (m, 9-H) u. a. - LC/MS: Elektrospray Positiv; Peak bei RT 40,7 min: m/z = 609 
(100 %) [M+NHU]*, m/z = 641 (10 %) [M+NH4+CH 3 OH] + (Pseudoaglycon von 
Spinosyn A); Peak bei RT 41,4 min: m/z = 623 (100 %) [M+NH4] + , m/z = 655 
(8 %) [M+NH4+CH 3 OH] + (Pseudoaglycon von Spinosyn D). - b) Aglycon von 
5 Spinosyn A: DC: R f (Si0 2 , Essigsaureethylester) = 0,29. - *H-NMR: CDC1 3 , 5 = 6,80 
(s, 13-H); 5,89 (d, 5-H); 5,80 (m, 6-H); 4,70 (m, 21-H); 4,44 (m, 9-H) u. a. - LC/MS: 
Elektrospray Positiv; Peak bei RT 36,8 min: m/z = 420 (100 %) [M+NH4] + , 
m/z = 452 (1 0 %) [M+NH4+CH 3 OH] + . 

10 Darstellung des Aglycon von Spinosyn A/D 

4,30 g (7,29 mmol) Pseudoaglycon von Spinosyn A/D wurden in 190 ml Methanol 
gelost und mit 285 ml 7,2 N H 2 S0 4 versetzt. Nach 3 h ErwSrmen unter RiickfluB 
wurde die abgekuhlte Reaktionsmischung vorsichtig in 1700 ml gesSttigte NaHCCV 

15 Losung gegeben. Man extrahierte mit Diethylether, wusch die Extrakte nacheinander 
mit gesStt. NaHC03-L6sung und gesatt. NaCl-L5sung, trocknete uber Na 2 S04 und 
engte im Vakuum ein. Nach Saulenchromatographie dieses FeststofTes an 650 cm 3 
Kieselgel (Eluent: Cyclohexan/Essigsaureethylester 1:2, dann 100% Essigsaure- 
ethylester) erhielt man 1,88 g (64 %) Aglycon von Spinosyn A. - DC: Rf (Si0 2 , 

20 Essigsaureethylester) = 0,29. - ! H-NMR: CDCI3, 8 = 6,80 (s, 13-H); 5,89 (d, 5-H); 
5,80 (m, 6-H); 4,70 (m, 21-H); 4,44 (m, 9-H) u. a. - LC/MS: Elektrospray Positiv; 
Peak bei RT 36,6 min: m/z = 420 (100 %) [M+NH4] + , m/z = 452 (14 %) 
[M+NH4+CH 3 OH] + (Aglycon von Spinosyn A). 

25 Forosaminylierung des Spinosyn-Aglycons und Ankntipfung eines Trimethyl- 
Rhamnosezuckers an das Spinosvn-Aglvcon durch Biotransformation mit einem 
rekombinanten Streptomvces Stamm, der Spinosvn-Zuckerbiosvnthesegene heterolog 
exprimiert 

30 20 ml R5A Medium (Illing et al., 1989) mit 5 ng Apramycin / ml wurde mit Mycel 
des rekombinanten Stammes S. albus (165-1) oder S. albus (165-8) angeimpft und 
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24 h aerob bei 28°C bebriitet. Diese Kultur wurde mit 50 jig/ml des hergestellten 
Spinosyn-Aglycons (100 [il einer 1 %igen Stammlosung in Methanol; Herstellung 
siehe Kapitel "Herstellung des Spinosyn-Aglycons und 17-Pseudoaglycons aus 
Tracer®", „Gewinnung von Spinpsyn A/D aus Tracer®" und „Darstellung des 
Aglycon von Spinosyn A/D'*) versetzt und ca. 120 h bei 28°C aerob inkubiert. Als 
Kontrolle wurde in gleicher Weise S. albus (pEBZ340; pOJ446-Vektor mit einem ca. 
1,8 kb groBen Spinosyn-PKS tragenden DNA-Fragment aus Cosmid 16-1-8) 
kultiviert und mit Spinosyn- Aglycon versetzt. Die Kulturen wurden nach Inkubation 
zur Abtrennung von Zellmycel zentrifugiert und der Uberstand (20 ml) wurde mit 

25 ml Methanol versetzt. 

Je 35 ml des mit Methanol versetzten Kulturuberstandes wurden lyophylisiert, mit 
15 ml Wasser aufgenommen und zweimal mit je 10 ml Essigs&ureethylester extra- 
hiert. Die vereinten organischen Phasen wurden zur Trockene eingeengt und mit 
15 350 nl Methanol aufgenommen. Ein Aliquot dieser Extrakte wurde mittels LC/MS 
mit Elektrospray Positiv-Ionisation untersucht. 

Der Kulturiiberstand der Anzucht von S. albus (165-1) enthielt eine Verbindung mit 
dem Molekulargewicht eines forosaminyherten Aglycons von Spinosyn A sowie 
20 Spinosyn A. 

Peak 1: RT = 41,0 min: m/z = 544 (100 %) [M+H] + , m/z = 576 (16 %) 
[M+H+CH 3 OH] + (Forosaminyliertes Aglycon von Spinosyn A); LC/MS/MS: 
m/z = 142 (38 %) (Forosamin-Fragment). 



5 



10 
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Forosaminyliertes Aglycon von Spinosyn A 
Molekulargewicht = 543 
Summenformel = C^H^gNOe 



Peak 2: RT = 44,2 min: m/z = 733 (100 %) [M+H] + (Spinosyn A); LC/MS/MS: 
m/z = 142 (21 %) (Forosamin-Fragment). 

5 

Der Kulturiiberstand von S. albus (165-8) enthielt eine Verbindung mit dem Mole- 
kulargewicht eines Forosaminylierten Aglycons von Spinosyn A. 
Peak 1: RT = 40,9 min: m/z = 544 (100 %) [M+H] + , m/z -576 (16 %) 
[M+H+CH 3 OH] + (Forosaminyliertes Aglycon von Spinosyn A); LC/MS/MS: 
10 m/z = 142 (39 %) (Forosamin-Fragment). 

Der Kulturiiberstand von S .albus (pEBZ340) enthielt keine Verbindungen mit MW 
543 und kein Spinosyn A. 

15 Forosaminylierung des Spinosyn- 17-Pseudoaglycons durch Biotransformation mit 
einem rekombinanten Strevtomvces Stamm. der Spinosyn Zuckerbiosvnthesegene 
heterolog exprimiert 

20 ml R5A Medium (Illing et al, 1989) mit 5 n-g Apramycin / ml wurde mit Mycel 
20 des rekombinanten Stammes S. albus (165-1) oder S. albus (165-8) angeimpft und 
24 h aerob bei 28°C bebriitet. Diese Kultur wurde mit 50 ^ig/ml des hergestellten 17- 
Pseudoaglycons von Spinosyn (100 jil einer 1 %igen Stammlosung in Methanol; 
Herstellung siehe Kapitel "Herstellung des Spinosyn- Aglycons und 17-Pseudo- 
aglycons aus Tracer®'*, „Gewinnung von Spinosyn A/D aus Tracer®" und ,J)ar- 
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stellung des 17-Pseudoaglycon von Spinosyn A/D") versetzt und ca. 120 h bei 28 °C 
aerob inkubiert. Die Kulturen wurden nach Inkubation zur Abtrennung von 
Zellmycel zentrifugiert und der Uberstand (20 ml) wurde mit 25 ml Methanol ver- 
setzt. 

5 

Je 35 ml des mit Methanol versetzten Kulturiiberstandes wurden lyophylisiert, mit 
15 ml Wasser aufgenommen und zweimal mit je 10 ml Essigsaureethylester extra- 
hiert. Die vereinten organischen Phasen wurden zur Trockene eingeengt und mit 
350 nl Methanol aufgenommen. Ein AUquot dieser Extrakte wurde mittels LC/MS 
1 0 mit Elektrospray Positiv-Ionisation untersucht. 

Der Kulturiiberstand von 5. albus (165-1) enthielt Spuren von Spinosyn A und D. 
Peak 1: RT = 44,2 min: m/z = 733 (100 %) [M+H] + (Spinosyn A); LC/MS/MS: 
m/z = 142 (8 %) (Forosamin-Fragment). 
15 Peak 2: RT = 44,7 min: m/z = 747 (100 %) [M+H] + (Spinosyn D); LC/MS/MS: 
m/z = 142 (37 %) (Forosamin-Fragment). 

Der Kulturiiberstand von S. albus (165-8) enthielt Spuren von Spinosyn A und D. 
Peak 1 : RT = 44,1 min: m/z = 733 (100 %) [M+H] + (Spinosyn A). 
20 Peak 2: RT= 44,7 min: m/z z = 747 (100 %) [M+H] + (Spinosyn D). 

Hinterlegung von Mikroorganismen 

Folgende Mikroorganismen und Plasmide sind bei der Deutschen Sammlung von 
25 Mikroorganismen und Zellkulturen GmbH (DSMZ), Mascheroder Weg lb, D- 38124 
Braunschweig, in Ubereinstimmung mit den Bestimmungen des Budapester Vertra- 
ges hinterlegt worden. 
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MikroorpanisTnus und Plasmid 


Hinterlegungs- 
nummer 


Datum 


F co/i XL 1 -Blue MRF'mit Cosmid 16-1-8 


DSM 12961 


1999-08-02 


£ cfl/i XL 1 -Blue MRF'mit Cosmid 16-2-2 

J^j. wL/i* fl 1 1 J JJlUV 1TU\1 1111 I \^l/dllllU A V A* 


DSM 12962 


1999-08-02 


F ro/i XL 1 -Blue MRF'mit Cosmid 16-59-1 


DSM 12963 


1999-08-02 


E. coli XLl-Blue MRF mit Cosmid 16-59-6 


DSM 12964 


1999-08-02 


E. coli XLl-Blue MRF'mit Cosmid 16-59-8 


DSM 12965 


1999-08-02 


E. coli XLl-Blue MRF'mit Cosmid 165-1 


DSM 13005 


1999-08-18 


E. coli XLl-Blue MRF'mit Cosmid 165-8 


DSM 13007 


1999-08-18 


K coli DH10B mit dem BAC Shuttle-Klon P8 / Gl 1 


DSM 13012 


1999-08-18 


E. coli DH10B mit dem BAC Shuttle-Klon PI 1 / BIO 


DSM 13011 


1999-08-18 


E. coli DH10B mit dem BAC Shuttle-Klon PI 1 / G6 


DSM 13010 


1999-08-18 
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Patentanspruche 

1. Nucleinsaure, welche zumindest eine Region umfasst, die fur eine Enzym- 
aktivitat codiert, welche an der Biosynthese von Spinosynen beteiligt ist. 

5 

2. Nucleinsaure gemaB Anspruch 1, dadurch gekennzeichnet, dass es sich um 
einzelstrangige oder doppelstrangige DNA oder RNA handelt. 

3. Nucleinsaure gemaB Anspruch 2, dadurch gekennzeichnet, dass es sich um 
1 0 ein DNA-Fragment handelt 

4. Nucleinsaure gemaB Anspruch 3, dadurch gekennzeichnet, dass sie alle 
Regionen umfasst, die fur Enzymaktivitaten codieren, welche an der Bio- 
synthese von Spinosynen beteiligt sind. 

15 

5. Nucleinsaure gemaB einem der Anspruche 1 bis 4, dadurch gekennzeichnet, 
dass es sich um Enzymaktivitaten von Polyketidsynthasen, Methyltrans- 
ferasen, Glycosyltransferasen, Epimerasen, Aminotransferase!!, Dimethyl- 
transferasen, Reduktasen, Dehydratasen und/oder Cyclisierungsenzymen 

20 handelt. 

6. Nucleinsaure gemaB einem der Anspruche 1 bis 5, dadurch gekennzeichnet, 
dass sie aus Saccharopolyspora spinosa stammt. 

25 7. Nucleinsaure gemaB Anspruch 1, umfassend zumindest eine Sequenz ausge- 
wahlt aus 



30 



(a) den Sequenzen gemaB SEQ ID NOS: 1, 2, 3, 4, 5, 6, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 52 
oder 54, 
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(b) zumindest 14 Basenpaare langen Teilsequenzen der unter (a) defi- 
nierten Sequenzen, 

(c) Sequenzen, welche an die unter (a) definierten Sequenzen hybri- 
5 disieren 

(d) Sequenzen, welche eine zumindest 70 %ige Identitat zu den unter (a) 
definierten Sequenzen aufweisen, 

10 (e) Sequenzen, welche zu den unter (a) definierten Sequenzen komple- 

mentar sind, und 

(g) Sequenzen, welche aufgrund der Degeneriertheit des genetischen 
Codes fiir dieselbe AminosSuresequenz kodieren wie die unter (a) bis 
15 (d) definierten Sequenzen. 

8. Nucleinsaure gemaB Anspruch 7, dadurch gekennzeichnet, dass sie die 
Sequenz gemSfl SEQ ID NOS: 1 bis 6 umfasst. 

20 9. Nucleinsaure gemSfl Anspruch 7, dadurch gekennzeichnet, dass sie die 
Sequenz gem&B SEQ ID NO: 4 umfasst. 

10. Nucleinsaure gemafl Anspruch 7, dadurch gekennzeichnet, dass sie die 
Sequenz gemaB SEQ ID NOS: 5 und 6 umfasst. 



25 



11. Nucleinsaure gemaB Anspruch 7, dadurch gekennzeichnet, dass sie zumindest 
eine Sequenz gemaB SEQ ID NOS: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35, 37 oder 39 umfasst. 
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12. 



Nucleinsaure gemaB Anspruch 7, dadurch gekennzeichnet, dass sie zumindest 
eine Sequenz gemaB SEQ ID NOS: 41, 43, 45, 47 oder 49 umfasst. 
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13. Regulatorische Region, welche die Transkription einer Nukleinsaure gemaB 
einem der Anspriiche 1 bis 7 in Saccharopolyspora spinosa kontrolliert. 

5 14. DNA-Konstrukt umfassend eine Nucleinsaure gemaB einem der Anspriiche 1 
bis 12 und zumindest einen heterologen Promotor. 

15. Vektor umfassend zumindest eine Nucleinsaure gemaB einem der Anspriiche 
1 bis 12, eine regulatorische Region gemaB Anspruch 13 oder ein DNA- 

10 Konstrukt gemaB Anspruch 1 4. 

16. Vektor gemaB Anspruch 15, dadurch gekennzeichnet, dass die Nucleinsaure 
funktionell mit regulatorischen Sequenzen verkniipft ist, welche die Ex- 
pression der codierenden Regionen der Nucleinsaure in pro- oder eukaryo- 

1 5 tischen Zellen gewahrleisten. 

17. Vektor gemaB Anspruch 15 oder 16, dadurch gekennzeichnet, dass es sich um 
einen BAC- Vektor, PAC- Vektor oder einen zu BAC- oder PAC-Vektoren 
funktionell gleichwertigen Vektor handelt. 

20 

18. Vektor gemaB Anspruch 17, dadurch gekennzeichnet, dass es sich um einen 
Vektor handelt, der den BAC-Klonen mit den Hinterlegungsnummern DSM 
13010, DSM 13011 oder DSM 13012 entspricht. 

25 19. Vektor gemaB einem der Anspriiche 15 bis 18, dadurch gekennzeichnet, dass 
es sich um einen Shuttle- Vektor handelt, der sowohl in Prokaryonten als auch 
in Eukaryonten iibertragbar ist. 



20. 
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Vektor gemaB einem der Anspriiche 15 bis 19, dadurch gekennzeichnet, dass 
es sich um einen Shuttle- Vektor handelt, der sowohl in Gram-negative und in 
Gram-positive Bakterien als auch in Archea iibertragbar ist. 
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21. Vektor gemaB einem der Anspriiche 15 bis 19, dadurch gekennzeichnet, dass 
es sich urn einen Shuttle-Vektor handelt, der sowohl in Escherichia cpli als 
auch in Actinomyceten ubertragbar ist. 

5 

22. Vektor gemaB Anspruch 21, dadurch gekennzeichnet, dass es sich urn einen 
Shuttle-Vektor handelt, der sowohl in Escherichia coli als auch in 
Streptomyces Ubertragbar ist. 

10 23. Vektor gemaB einem der Anspriiche 15 bis 22, dadurch gekennzeichnet, dass 
er in einem Prokaryonten autonom replizierbar ist. 

24. Vektor gemaB einem der Anspriiche 15 bis 22, dadurch gekennzeichnet, dass 
er in einem Prokaryonten unter Beteiligung des Phagen OC31-, des pSAM2- 

1 5 oder des Mini-Circle-Integrationsmechanismus ins Genom integrierbar ist. 

25. Vektor gemaB einem der Anspriiche 15 bis 22, dadurch gekennzeichnet, dass 
er in einem Prokaryonten durch RecA-vermittelte Rekombination ins Genom 
integrierbar ist. 

20 

26. Vektor gemaB einem der Anspriiche 15 bis 22, dadurch gekennzeichnet, dass 
er in einem Prokaryonten durch RecE- und RecT-vermittelte Rekombination 
ins Genom integrierbar ist. 



25 27. Wirtszelle enthaltend eine Nucleinsaure gemaB einem der Anspriiche 1 bis 12, 
eine regulatorische Region gemaB Anspruch 13, ein DNA-Konstrukt gemaB 
Anspruch 14 oder zumindest einen Vektor gemaB einem der Anspriiche 15 bis 
26. 

30 28. Wirtszelle gemaB Anspruch 27, dadurch gekennzeichnet, dass es sich um eine 
pro- oder eukaryotische Zelle handelt. 
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29. Wirtszelle gemafl Anspruch 28, dadurch gekennzeichnet, dass die prokaryo- 
tische Zelle zur Gruppe der Actinomyceten, bevorzugt zur Gruppe der Strep- 
tomyceten gehort. 

5 

30. Wirtszelle gemafl Anspruch 28, dadurch gekennzeichnet, dass die eukaryo- 
tische Zelle eine Pflanzenzelle ist. 

31. Polypeptid, welches von einer Nucleinsaure gemafl einem der Anspriiche 1 
10 bis 7 codiert wird. 

32. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat einer Methyltransferase aufweist. 

15 33. Polypeptid gemafl Anspruch 32, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ED NOS: 8, 12, 14, 18 oder 20, oder eine Teil- 
sequenz davon aufweist. 

34. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
20 tat einer Glycosyltransferase aufweist. 

35. Polypeptid gemSB Anspruch 34, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ID NOS: 10 oder 30, oder eine Teilsequenz davon 
aufweist. 



25 



36. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat eines C-C verkmipfenden Enzyms, das Cyclisierungsreaktionen durch- 
fiihrt, aufweist. 



30 



37. 



Polypeptid gemafl Anspruch 36, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ID NO: 16 oder eine Teilsequenz davon aufweist. 
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38. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat eines Enzyms, das an Cyclisierungsreaktionen beteiligt ist, aufweist. 

5 39. Polypeptid gemaB Anspruch 38, dadurch gekennzeichnet, dass es die Amino- 
s&uresequenz gemaB SEQ ID NO: 22 oder eine Teilsequenz davon aufweist. 

40. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die 
Aktivitat einer 2,3-Reduktase aufweist. 

10 

41. Polypeptid gemaB Anspruch 40, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemaB SEQ ID NO: 24 oder eine Teilsequenz davon aufweist. 

42. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die 
1 5 Aktivitat einer 2,3 -Dehydratase aufweist. 

43. Polypeptid gemaB Anspruch 42, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemaB SEQ ID NO: 26 oder eine Teilsequenz davon aufweist. 

20 44. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat einer Thioesterase aufweist. 

45. Polypeptid gemafl Anspruch 44, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemaB SEQ ID NO: 28 oder eine Teilsequenz davon aufweist. 



25 



46. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die 
Aktivitat einer 3,4-Dehydratase aufweist 



47. 

30 



Polypeptid gemaB Anspruch 46, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemaB SEQ ID NO: 32 oder eine Teilsequenz davon aufweist. 
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48. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat einer 4- Aminotransferase aufweist. * * 

49. Polypeptid gemaB Anspruch 48, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ID NO: 34 oder eine Teilsequenz davon aufweist 

50. Polypeptid gemafl Anspnich 31, dadurch gekennzeichnet, dass es die Akti- 
vitat einer N-Dimethyltransferase aufweist. 

51. Polypeptid gemaB Anspruch 50, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ED NO: 36 oder eine Teilsequenz davon aufweist. 

52. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Akti- 
vitat einer 3,4-Reduktase aufweist. 

53. Polypeptid gemaB Anspruch 52, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ID NO: 38 oder eine Teilsequenz davon aufweist. 

54. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat eines Transkriptions-Regulators aufweist. 

55. Polypeptid gemafl Anspruch 54, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ID NO: 40 oder eine Teilsequenz davon aufweist. 

56. Polypeptid gemafl Anspruch 31, dadurch gekennzeichnet, dass es die Akti- 
vitat einer Polyketidsynthase aufweist. 



57. 



Polypeptid gemafl Anspruch 56, dadurch gekennzeichnet, dass es die Amino- 
sauresequenz gemafl SEQ ID NOS:: 42, 44, 46, 48 oder 50, oder eine Teil- 
sequenz davon aufweist. 
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58. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die Aktivi- 
tat einer Glucose-Dehydratase aufweist. 

59. Polypeptid gemaB Anspruch 58, dadurch gekennzeichnet, dass es die Amino- 
5 sauresequenz gemaB SEQ ID NO: 53 aufweist. 

60. Polypeptid gemaB Anspruch 31, dadurch gekennzeichnet, dass es die Akti- 
vitat einer 3,5-Epimerase aufweist. 

10 61. Polypeptid gemaB Anspruch 60, dadurch gekennzeichnet, dass es die Amino- 
s£uresequenz gemSB SEQ ID NO: 55 aufweist. 

62. Enzyme, die an Cyclisierungsreaktionen beteiligt sind, dadurch gekenn- 
zeichnet, dass sie die Aminosauresequenz gemaB SEQ ID NO: 15 oder 22, 

1 5 oder eine Teilsequenz davon, welche zumindest noch eine Teilreaktion durch- 

fiihren kann, umfassen oder eine mindestens 50%ige Identitat dazu auf 
Amino s aureeb ene aufweisen. 

63. Antikorper, welcher spezifisch mit einem Polypeptid gemaB einem der An- 
20 sprttche 3 1 bis 62 reagiert. 

64. Verfahren zur Herstellung einer Nucleinsaure gemaB einem der Anspruche 1 
bis 7, umfassend die folgenden Schritte: 

25 (a) Vollstandige chemische Synthese auf an sich bekannte Weise oder 

(b) chemische Synthese von Oligonucleotiden, Markieren der Oligo- 
nucleotide, Hybridisieren der Oligonucleotide an DNA einer geno- 
mischen oder cDNA-Bank, die ausgehend von genomischer DNA 
30 bzw. mRNA aus S. spinosa hergestellt wurde, Selektieren von 
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positiven Klonen und Isolieren der hybridisierenden DNA aus positiv- 
en Klonen oder 

(c) chemische Synthese yon Oligonucleotiden und Amplifizierung der 
5 Ziel-DNA mittels PCR. 

65. Verfahren zur Herstellung eines Polypeptids gemaB einem der Anspriiche 31 
bis 62, umfassend die folgenden Schritte: 

10 (a) Kultivieren einer Wirtszelle gemaB einem der Anspriiche 27 bis 30 

unter Bedingungen, die die Expression der Nucleinsaure gemaB einem 
der Anspriiche 1 bis 7 gewahrleisten, oder 

(al) Exprimieren einer Nucleinsaure gemaB einem der Anpriiche 1 bis 7 in 
1 5 einem in v/fro-System, und 

(b) Gewinnen des Polypeptids aus der Zelle, dem Kulturmedium oder 
dem in vi/ro-System. 

20 66. Verfahren zum Herstellen von Spinosyn, Spinosyn-Vorstufen oder Spinosyn- 
Derivaten umfassend die folgenden Schritte: 

(a) Kultivieren einer Wirtszelle gemaB einem der Anspriiche 27 bis 30 
unter Bedingungen, die die Expression der Nucleinsaure gemaB einem 

25 der Anspriiche 1 bis 7 gewahrleisten und 

(b) Gewinnen des Spinosyns, der Spinosyn- Vorstufe oder des Spinosyn- 
Derivates aus der Zelle oder dem Kulturmedium. 



30 67. 



Verfahren zum Herstellen von Spinosyn-Derivaten, einschlieBlich Spinosyn- 
Vorstufen, umfassend die folgenden Schritte: 
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(a) Austauschen zumindest einer Modul-codierenden Nucleinsaure- 
sequenz gemaB Anspruch 7 gegen zumindest eine andere Modul- 
codierende Nucleinsauresequenz gemaB Anspruch 7, oder 

5 

(b) Austauschen zumindest einer Modul-codierenden Nucleinsaure- 
sequenz gemaB Anspruch 7 gegen zumindest eine andere Modul- 
codierende Nucleinsauresequenz aus S. spinosa, oder 

10 (c) Austauschen zumindest einer Modul-codierenden Nucleinsaure- 

sequenz gemaB Anspruch 7 gegen zumindest eine andere Modul- 
codierende Nucleinsauresequenz aus einem anderen Organismus als S. 
spinosa, oder 

15 (d) Austauschen zumindest einer Domanen-codierenden Nucleinsaure- 

sequenz gemaB Anspruch 7 gegen zumindest eine andere Domanen- 
codierende Nucleinsauresequenz gemaB Anspruch 7, oder 

(e) Austauschen zumindest einer Domanen-codierenden Nucleinsaure- 
20 sequenz gemaB Anspruch 7 gegen zumindest eine andere Domanen- 

codierende Nukleinsauresequenz aus S. spinosa, oder 

(f) Austauschen zumindest einer Domanen-codierenden Nucleinsaure- 
sequenz gemSB Anspruch 7 gegen zumindest eine andere Domanen- 

25 codierende Nucleinsauresequenz aus einem anderen Organismus als S. 

spinosa, oder 

(g) Austauschen einer ersten Acyltransferase-codierenden Nucleinsaure- 
sequenz gemaB Anspruch 7 gegen eine zweite Acyltransferase- 

30 codierende Nucleinsauresequenz gemaB Anspruch 1, wobei die zweite 
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Acyltransferase eine unterschiedliche Substratspezifitat aufweist als 
die erste Acyltransferase, oder 

(h) Austauschen einer ersten Acyltransferase-codierenden NucleinsSure- 
5 sequenz gemaB Anspruch 7 gegen eine zweite Acyltransferase- 

codierende Nucleinsauresequenz aus S. spinosa, wobei die zweite 
Acyltransferase eine unterschiedliche Substratspezifitat aufweist als 
die erste Acyltransferase, oder 

10 (i) Austauschen einer ersten Acyltransferase-codierenden Nucleinsaure- 

sequenz gemaB Anspruch 7 gegen eine zweite Acyltransferase- 
codierende Nucleinsauresequenz aus einem anderen Organismus als S. 
spinosa, wobei die zweite Acyltransferase eine unterschiedliche 
Substratspezifitat aufweist als die erste Acyltransferase, oder 

15 

(j) Deletieren zumindest einer Domanen-codierender Nucleinsaure- 
sequenz gemaB Anspruch 7, oder 

(k) Integratieren zumindest einer Domanen-codierenden Nucleinsaure- 
20 sequenz gemaB Anspruch 7 in eine Modul-codierende Nucleinsaure- 

sequenz gemaB Anspruch 7, oder 

(1) Mutagenisieren zumindest einer Domanen-codierenden Nucleinsaure- 
sequenz gemaB Anspruch 7, 



25 



und Exprimieren der rekombinierten Nucleinsauresequenz in einer Wirtszelle 
unter Bedingungen, welche die Synthese eines Spinosyn-Derivates oder einer 
Spinosyn-Vorstufe erlauben. 
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68. Venvendung einer Nucleinsaure gemaB einem der Anspriiche 1 bis 7 zum 
Identifizieren, Inaktivieren und/oder Modifizieren von Genen der Spinosyn- 
Biosynthese. 

5 69. Verwendung einer Nucleinsaure gemaB einem der Anspriiche 1 bis 7 zum Er- 
zeugen einer Bibliothek aus Polyketidsynthasen. 

Verfahren zum Anfiigen eines Forosamin-Zuckerrestes an das Spinosyn- 
Aglycon oder an das Spinosyn-17-Pseudoaglycon oder an ein Polyketid- 
Aglycon, umfassend die folgenden Schritte: 

Ubertragen einer Nucleinsaure gemSB SEQ ID NOS: 23, 25, 29, 31, 
33, 35 und 37 in eine Wirtszelle, die das Spinosyn-Aglycon oder das 
Spinosyn-17-Pseudoaglycon oder das Polyketid- Aglycon herstellen 
kann, oder 

Ubertragen einer Nucleinsaure gemaB SEQ ID NOS: 23, 25, 29, 31, 
33, 35 und 37 in eine Wirtszelle, die das Spinosyn-Aglycon oder das 
Spinosyn-17-Pseudoaglycon oder das Polyketid- Aglycon nicht her- 
stellen kann und Zufugen des Spinosyn-Aglycons oder des Spinosyn- 
17-Pseudoaglycons oder des Polyketid- Aglycons zum Kulturmedium, 
und 

Kultivieren der Wirtszelle unter Bedingungen, die zu einem aktiven 
Stoffwechsel der Zelle fiihren. 



70. 

10 



(a) 



15 



(al) 



20 



Cb) 



25 



71. 



Verfahren zum Anfiigen eines Trimethyl-Rhamnose-Zuckerrestes an das 
Spinosyn-Aglycon oder an das Spinosyn-9-Pseudoaglycon oder an ein 
Polyketid-Aglycon, umfassend die folgenden Schritte: 
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(a) Ubertragen einer Nucleinsaure gemaB SEQ ID NO: 7, 9, 11, 13, 17 
und/oder 19 in eine Wirtszelle, die das Spinosyn-Aglycon oder das 
Spinosyn-9-Pseudoaglycon oder das Polyketid-Aglycon herstellen 
kann, oder 

(al) Ubertragen einer Nucleinsaure gemaB SEQ ID NO: 7, 9, 11, 13, 17 
und/oder 19 in eine Wirtszelle, die das Spinosyn-Aglycon oder das 
Spinosyn-9-Pseudoaglycon oder das Polyketid-Aglycon nicht her- 
stellen kann und Zufugen des Spinosyn-Aglycons oder des Spinosyn- 
9-Pseudoaglycons oder des Polyketid-Aglycons zum Kulturmedium, 
und 

(b) Kultivieren der Wirtszelle unter Bedingungen, die zu einem aktiven 
Stoffwechsel der Zelle fuhren. 

Verfahren gemaB Anspruch 71, dadurch gekennzeichnet, dass im Schritt (a) 
NucleinsSuren gemaB SEQ ID NOS: 9, 11, 13 und 17 ubertragen werden. 
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SEQUENZPROTOKOLL 
<110> Bayer Aktiengesellschaf t 

<120> Nucleinsauren, die fur Enzymaktivitaten der 
Spinosyn-Biosynthese codieren 

<130> Le A 33 955 

<140> 
<141> 

<150> DE 199 40 596.4 
<151> 1999-08-27 

<150> DE 199 57 268.2 
<151> 1999-11-29 

<160> 55 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 50000 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 1 

ggcaccaccc cggggccccc gcctcactcg gagtcgatcc gcattccggt cacgaacatg 60 

tgacccgtcg agtgcgtgat cattagctcg gtcttcgcct cgatcgccgc ctgctgcggg 120 

gtcacgccgc aggcccagaa caccggcacg tcgcccggct ccatacgcac tcgttcaccc 180 

cagtcaggtt catcaacgtt gtcgatgccc aacttctcag gggagccgat gtgcacgggg 240 

cccccgtgca cctcgggatg cgcatgcgtg atggtgacag cctccgccac ccggtccggg 3 00 

ggcatgggac gcattgaaac cactagcttc ccccggaact ggcctgcggg cacacactca 3 60 

cggttgctga ggtaaaccgg gccgttgatc ccctggtcga tgtgccggat cgggagtccg 420 

gcatcgagca gcaagtgctc gaaggagaag ctgcagccaa gcaggaacgc gaccatgtcg 4 80 

tcggtccaca ggtcaacgac gtcactgcgc tcctccacca gcgccccttc ccggtacacg 54 0 

ctgtatttgg gcaggtcggt gcggaggtca gcaccagggg cggtcagctt cggctccgga 600 

tctcccgcgt cggtgacctc gaggatcggg cacggcttgg ggttccgctg ggcgaaccgc 660 

atgaagtcga acgcgtcatc ctggggcagc atcacgaggt tggcctgcac gaatccatga 720 

cagatcccgg acgtgggcct cgcccacgca ttcgcgcggt tggcacgcca tgcctcgatg 780 

ggctccgcgt gctgcagatc aacgaactcc atcgatgact tccccttccc ggcgatgatg 840 

acgaaggctg gcaaccgagg gttgcccgct acgggtggcc gtcactggtc tgccacccag 900 

gggtcgccga gcaggctctg gtgaacgatt ccggagatca gctcggcatg cgcggcgacg 960 

aaacgcagtg ggatcccgtc gtcgccttgg aacgcgggat cgtcaacgcc gccgctgcgt 1020 

agacctgcgc gcagccccag ctcggtccac tcgcgcacca gatccccggt gactgccagg 1080 

tttgcgcgtc gagccaggat ggccaccgca ccggtgatga cgtagcagcc caaattcgac 1140 

acggatgcag gcagcagatg atccaccgtc gtggacgcga agaatccgcc ctcaacgctg 1200 
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acgcctcgcg gagtcagcgc ctcggactcg gcggcaatgt tgccgaaccc gatctcgttg 1260 
ccgttgtctc ccacggccag cgtcagagca ccgttgtccg tggcggcgcg aaggtagtcg 1320 
tcggcaaaca gatcaccggc ggtgaccggc gtgccccaga tcagatgacg gccgccaaca 1380 
gagttctgcc ccagcttctc gatggcggcg acgacgccga actgcgccgc gaactcccgg 1440 
gcctcggccg cgctggagaa tcgggactcc atcaaggtca cgccatccag gcccgcagcg 1500 
tcgaccagct tctccaccgg gccgacggcc tctgggtcga tgacgatggt cacctcgcta 1560 
ccaagaccgg cgagcgtgcg cgcgagtgcc agagagccga tggggccgtc cacctccccc 162 0 
tgcgggaacc ggccttcctc gaccagaccc gtgacgatca acgccttttc accctgtcga 1680 
cgcagcagag agcgcgcgat cgcaaggctg atcggctccg accgctcacc ccgcgccagc 174 0 
gtgtagaacg ggctcacgat gccggccggg ctcgtcggcg gacgcatctc gatcgtggcc 1800 
agtcgatcca ggctctcgaa ggccgggtgc gactcgggct tcatcagctc tccctcggta 1860 
tacgacatat cccaaggact tcatccgcta cggagtcccg cctcgggcgg ttgcttacgc 1920 
ggtgaggacc atataccgta ttccactgtc caaggaaggg ggacgatgga accgaatcag 1980 
ccacgcactg cgggcacgct gaacatggtg gagctgcacc atcgcccagc cgccgcagac 204 0 
tccgcggtgg ggtccggcaa cggtatgctc gcacccctcg acgcacgcgt cgacaatctg 2100 
acagatcggg tcttcgaggc catccgcgac agcatcgtaa gtcaggcact cccacctggg 2160 
cgtcgtgtga gcgaggccaa gctcgcggcc gagctgaatg tcagcaagac ccccgtgcgc 2220 
gaggcgcttc tccgcttgcg gcacgtgggc ctggtcgagg ctgccccgaa ggggctgcgc 2280 
gtcatcagcg cctcggtccg cgccattcgc gacgcctacg agcaccgggc ggggctggag 234 0 
agctcggcag cctggttcac cgcacatcgg gctaccagcg cccaacggga gaggctccac 2400 
gaggtcgcca cctcatccct gcgcgcggca gagtccggcg actccgccca gttccgcgcc 2460 
gaggatcgcg tcttccacca catcatcgcc gagttctgcg ggaacgagat cctccagcgc 2520 
gcggtcgaca acgccctggt gctgaccgct gtactccgag aacgcgacac tccctcgacc 2580 
ggtgactcca tctcctgcgc ctcggagcac atcgcgacgg catccgccgt tctggctggc 264 0 
gacgccgacc gggcagcccg gaacaacgcc gaccacatcc tccacgtgat gtcgcttgtc 2700 
ctctcctcac acacgcagcg cgggaacgtg cccccgggaa cgagttaaac caactccccc 2760 
accccctacc tcaccattcg accgaaggac gggctttcgt gaccgagcac ctggcgacca 2820 
tcaccgctgc ggaaggagag atccccgttg tcgtgagcga ccgcggcgtc gcagcaatcc 2880 
gggagataga ccccgagctg gcggtcacga ccatcatgga cgtgatcgca ggagaccacc 294 0 
tcgcgcggct gcgtgagcgt gtcaagaacg cccccgacct cgccttcaca cgcgtggacg 3000 
aggtgcggtt caccgcgccg taccgatccc cccggaagat ctggggcatc ggtctgaact 3060 
acggcgatca cgcagcggac ctgagcgagt cggcgcccga ccagccggcg tccttcgtga 3120 
agtgcgacca cacgatcatc ggcccggatg agccgatcgt gattccggcg cagagtgagc 3180 
gcaccacgtc cgaggctgag atcgggctca tcatcggcag gacttgcgag aacgtcagcg 324 0 
agtccgaggc gatggactac gtgtgggggg tcacgaccat cctggaccag accgccgagg 3300 
acatcttgcg gatcaatccg cgctatctca cccgctcgaa gaacttccgg acgttcttct 3360 
gcttcggacc ggagctcgta ccgctcgatg aggtgataga gaagttcgaa agcttggacg 3420 
acatcgagat ctcgacggtc aagaacggcg atgagttcag gaccaacacg gtcgcccaca 3480 
tgacccacaa gcccgccagc ctcgtgagct tccactcgaa gatgatgccg ctgtttccgg 3540 
gagacatcat ctccaccgga acccctggcg cgctcgtcat cggcgacgga gaccgtgtgg 3600 
agtgccggat cgacggcatc ggtcagctga ccacgacggc gcgaaaggcc actacacgat 3660 
gagcactcac gcggtcgagc agctcgaccg gacccgactc ctgcaatgtg ccgacgacct 3720 
ccgcgagacc gtcctcgacg acgcactcaa gggcgtgccc ctgggtgctc gtgtgctcct 3780 
cggagaggtg gaccagcaga actggaacgt ggcccgcgga gacctcgccc tgccggtgac 3 84 0 
cacactccac gcacgcgcta tgaacgccaa cctggcgact atggaggcgt actgcgagcg 3 900 
caacaacgcg cttctcgccc cgcacggcaa gaccaccatg tcaccccagc tcttcgccct 3 960 
tcaggcgtct cacggtgcgt gggcgttcac ggcagcgacg cccactcagg tggccgtgat 4 020 
gcgccgctac ggggttcagc gcatcatcct ggccaacgaa ctcgtcgatg ctcccgcgct 4080 
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gcgttgggtg actgccgaaa tcgccagcga 
cgatgcggac acggtgcggg cgatgtccga 
cctcaacgtc ctcctcgagg tcggagtgcc 
ctcggcgctc gaggtcgccg cggcggtcgc 
cgaggcctac gagggcctcg tgaccggcgg 
caccttcttc gctggcgtcc ggtcgatcgt 
cgccaaccgc atcatcgtga cggcaggcgg 
actcggctcg tgggacggtg tccgcgacga 
catctcccac gacgccggca agtacgaaaa 
gcaggaaccg ctccggctcc acaacgcact 
ggagcctgat ctcgcgatcc tgtcgatcgg 
gccacagcca cgcgagctgt ttcgcgcgga 
gacgttcaag ctgatggacc agcacgcctt 
tcccggagac atcgtcgctt tcgacatgtc 
cttcatcccg atcatcgacg acgacttcaa 
acggggctca ggacctccta cgggaggctc 
tcagttggac atgaccacca gccacagaga 
acagacgtcg accttcctcc cgcgggaggc 
atcgtggcaa tcgccgggca gtgcggctac 
ctggaacctc aggtgcgccg ggcattcgag 
tgcagcgagg cagacgtgat caccgtgaat 
gcgatgaacg cgatctaccg cgagtacttc 
accgccggtc tccgacccgg tgtcctgttc 
tgaccgggcc actgccactc ctcccaccaa 
tccgggccgc cgtcacccga aaccgcaagg 
agtcgccctc gtactcggag ccggcggcgg 
ccgggaaggc gcgcgtgtgg cgctggccga 
ggcgtccatg accggcgata acatgagcct 
cgatgagaac gtcaccgccg tcgagagcga 
caccggcggc ccgccgccca cccgtgtgtc 
cttcgaggcg atgatcctct ccgtcgtcgc 
cgagcgggga tggggccgga tcatcacgtc 
caacctcggg ttgtcgaacg ccctccgctc 
cggtgaggtg gcaagcgacg gcatcaccag 
tgcgcgcatc gcccagttgg accaggcaaa 
catcgcgagg gaatcagaga gctctattcc 
cgccgacgcg gtcacatttc tcgccagcga 
ccgtgtcgat ggcggactgc tccagaacgt 
cagcgcaacc cgaagccatg tccgctcctc 
aaactcatca cctacgtgcg ctgatctacg 
atcacgatca gcagcacgga cctcgcctcc 
cggttttcgg gtcgccggcg acccagcttt 
tgctgcctgg aaggcgtact tcaggtatgg 
atcgtatctc tggaaagagg tcgtccggtg 
gaaacagcta cgccgacgct acggcatcca 
tccggaaggc cggctcggga gacgtgcagc 
gacgacctgg cactaccagg acagctcgtg 
cctccgatct tgctaacagt caagcggcga 



cagctcgttc gacttctact gcctggtcga 4140 
taccatcgcc gactccggcc acggggtcac 4200 
tggtggcagg tgtggcgtgc gagacctcga 4260 
cgcgaccgag catctccggc tggtcggcgt 4320 
catgacaccg gaggacctcc atgcactcga 43 8 0 
gctcgagctg gcacgtcgcg acctgttcga 4440 
aagctcgtac ttcgatcgtg tcgtcgccgg" 4500 
cgtggaccta gtgcttcgaa gcggctgtta 4560 
gctctcgccg ctcgctggcc gacgggccga 4620 
cactgcatgg gcttccgtgc tgtcccggcc 4680 
gaagcgcgac gcggcccatg acctgacgct 4 74 0 
tggctcgagg cagcggctgc gggaagcgga 4 800 
cgttcgcgtt gcccccgggc tcgcgatcgc 4 860 
ccacccgtgc acggccttcg acaagctccc 4920 
cgtcgtcgac ggcgtcctca cgttcttctg 4980 
cggttgaacc gaccccacaa gctcccgaag 5040 
ggccccgaca ccgtgtcccg tcaggcgttc 5100 
ccttactccg ccgccgtacg catcggcaac 5160 
ctcgccgaca gatccctcgc accgggtggt 522 0 
aacctcatga ccgcgctgcg ttcagccggc 5280 
gtgttcctcg ctgacgggga cgacttcgac 534 0 
agcgagcccg gcccggcccg gacgaccatc 54 00 
gaggtcagcg ctcaggccgt ggtaggcggc 54 60 
tgtccctcca tcaacggcgt gccacgactt 552 0 
acatgtcatg gatctcgcaa tcaatggaag 55 80 
actcggttcg gccattgccg cctcgctaag 5640 
catcgacaag gacgcgctca cggaggtcga 5700 
ggtctgggac ttggcgaacc tgtcggtcat 5760 
gctcggcccg gtggagatcc tggtcaacat 5820 
cgggcaggac gccaccctgt ggcgtcggca 58 80 
gatcacagac cgcgtgctcc cgggcatgac 594 0 
gacttcgtcc ggcgtcatcg cgcccatccc 6000 
gagtctggtg ggctggtcca agacactgtc 6060 
caacatcgtc gttccggggc gcatcgccac 612 0 
ggccgcacgg gagaaccgca ctgcggagaa 6180 
ggtcggccgc tacgggcggc gcgaggaatt 6240 
gcgtgccagc tacatcaccg gatccgttgt 6300 
gtaaccccga caaacatcct gtctccagat 6360 
ggggtcactg acgtcagcga tcccgagccc 6420 
gtcatccagg gctacccact cgccagcggg 64 80 
agggtgcacg ctgcgcgcga gcggtgccgt 654 0 
tccccacatt gaaagctatc cggaaccgga 6600 
atgcccgaac gcgaccttca gctacctacg 6660 
gcaggagcgc aaacaccggc gtactccctg 6720 
gccagccgac ggcgatgtcg ggctgttcga 6780 
cgggtgggga aggagcgctg acaaaatcct 684 0 
ggcaagcgag atcaggcggg gtatgcgaaa 6900 
cccgtttcat ggtcgtgtaa tcctcgcggc 6960 
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cacccgcgct ggcctcacgt ctgtgcgcca gcggaaatgt cgcacaggcg cggtattccc 7020 

gaaaggaacc catcgaatca tgacgaaccg tcccttccgc tggtcgctcg tgggcgccag 7080 

cgacatcgcc cgcgcatttg taggcccagc gatctccgcc cagcctgacg ccgcggtggt 714 0 

cagcgtcgtc agcggcgacc tggcccgagc gcaggatgta gcggcaccat tcggcgcccg 72pp 

gggccatgac gaactcaccg tggcgcttag cgatcctgac g.tcgacgccg tgtacatctc 7260 

gagtatcaac tcggcacatc acgatcaagc acttgcggcg atcgcagccg gcaagcacgt 7320 

cctgtgcgaa aaaccgcttg ccctgaccgt cgccgaggca caagaacttg tcgatgccgc 7380 

ggagaaggca ggcgtagtgt tcgcgacaaa ccaccacatg cgcaactccg tgccacatcg 7440 

tttgatccgg gatgcaatcg ccgccggcga catcggcgag ccggtcgcag tgaacgtccg 7500 

gaacgcgatc cgactgccgc ccgccgcgcg gagatggcgg acgaccgatt cggcagcggg 7560 

cgccggagtc gcactcgacc tcacggtgca cgacgcggac tgtctccggt tcgtactcaa 7620 

tgccgatccg ctaacggtcg tggcccatac ctcgtccggc cacatgacga gggtgggaat 7680 

cgatgagaca atctccggaa gcgcggagtt cgtcagcggt gtgcatgctt cattcgtgga 774 0 

aagcttcgtg accggacatg cccccacatg cctggaggtt ttcggaaccg aaggagcgct 7800 

catcggcacg ggtatccagt cgatgtcacc ggtcggcacg ctgacacacg tcggtccggc 7860 

cgggtccagg aagatcgatc tgggtacccg cgaggacctc tacgtcgtgg gtgttcgacg 7920 

gttcatcgag gctgtgcgtg gcgaggccga gcccccggcc tccggccaag acggggtctg 7980 

gtcgttggcc ttcgcacggg cggcgctgga ggctgctcgg accgggcatc ggcagactgt 8040 

gggacaggct cgatgacgaa agctgggcgc agttaggcac aacagatcat gcacaaggac 8100 

aaccgggcat cggcagacgt ggcgatcgtc gggtccggtg gaggctgact gatcgtcgcc 8160 

ggggcgctcg ctggtcgcgg actcgatgtg ctcgtgttgg aacgggacga gtttttggct 8220 

caagagcccg ctatcgacgt gtgcaaggct tcgcgcacgg gcactacaag ccggacgaac 8280 

ggtggctcga tggtgctggg agaccgttca caccggctac tcactactcg tcggcggcaa 834 0. 

cagccaaggc ctacgaagct ggcccgtggc tcaatctata ccgcgtaagc tcgccaacgg 84 00 

gtcctatgtg ggcggccgca actacatgca gcacccgaac tccgcggtca tggccatcgg 8460 

caatggccag cctccgatca tggaccatac ctacttcaac atgcgccgtt gcgggggccg 8520 

cggacgatta gctgaagcgg atgtccgctt tcggtatcgc agaccttgcc acgacgggcg 8580 

gtgttgccgg gccgcgaacg gatcatgtga gccgggctgg cccgagaagg gcatgggcca 8640 

acagagcagg tgactacgtt atctctgccc ggtcatccaa ctgatgacct gtgcgcgcga 87 00 

ggtaaagccc agcttggaaa ggatgttctc gacgtgggcc tcggctgtgc gctgggaaat 8760 

gaccaggtca gccgcaatct ctttgtttgt tttgccagcg gtgaccaggg cagcgacctc 8820 

gcgttctcgt tttgtgagaa ccggttcggt ttccggagcc gaagcggcgg ccgcttcgac 8880 

cggcggtggc ttctccccaa gagcatagaa gatagtctgc tcccggttca tccgcgtccc 894 0 

ctggcctcgg gcggtttcgt attcggtgtc tgtgagtgct tcctgggcgc gtgccttggc 9000 

ctgctcacgc caggtcagta gcgtctcgga tccgaacagc ggggtgccaa tcagttccca 9060 

cattttatcc aacgccccga aaaggatcgc cgcgcgttca ggattgcgtt ctgcggcggc 9120 

gacccaggcc agcagttcaa cgcagtcgct gatcccgagt aggtcgttga gttcccgctt 9180 

attgcggagg gacctggacg cggaggcggc tgccttggtt gggtcgcccg cggcccacca 9240 

agtgatcgcc atgttccact ccgcccacga tagcctccag cgctcgccaa gttctgtaca 9300 

gatcttgcgg cattcgtgca gaaggaccat cgcccggtca aggtctccaa gcaggcccgc 93 60 

tgcctgcgct cgtattgaga aaatggctaa ggcggggccg gtccagtggc cagttgcgcg 94 20 

gtggcgggcc agtgcctcgt cgagcagttg cgtggcgcga gtcaggttgt tgtcccacat 94 80 

ctcccggtcg ccaagaaact gagtcgcgta ggtaaattcc gtttcgtcgc ccagtttccg 954 0 

ggccagctcg cggctttcgt taagcaggac cagcgcccca tctcggtccc cttgccggtg 9600 

ggcaatccat ccggtgatcc acagtgcgcg ggcacgctgg ctggtgggtt cggtctccag 9660 

ggccaacgcc ctgtccagcc acatgcggcc ctccctcaga tagtcgccag cagcccagta 97 2 0 

gaaccacagc gcgctcacca tccgcaaccc aaccgacgcc tcgccaggtg tggtgaagca 9780 

gtagtccaac gctgcccaga agttgccgcg ctcggcgcgc agacgcgcga gccaaccggc 9840 
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ttgatgcggg cccgcggact cggcatcgga ctgctcggcg agatgcaggt agtagtcgcg 9900 
gtgccggcag cgcagagcgt tttcggcacc cgcctcggtg agccggtcgt gcccgtattg 9960 
acggatagtc tccagcatcc ggtatctcgt cttcgtgcgc tcctcggtcc gggtgagcac 10020 
ggacttgtcg atcagtcccg cgagagcggc gaagacgtca- tcatcggtca gcccgtcgcc 10080 
ggtgcagacg ttttcggctg cgtcgagatc gaattcgtcg gtgaaaaccg aacagcgcgc 1014 0 
ccacaactgc ttctcatgct cattgcacag gttgtagctc cagtcgacgg cggcgcgcag 10200 
ggtctgatgc cgaggatcgt cggcgcggtc gccggtggtg agcagccgga atcggttctc 10260 
cattcgcgtt agaatttgcc cgactgagag cacgcgaacg cgcacggcgg cgagttcgat 10320 
cgctagggga atgccgtcca gcctccggca taacctggcc acgatctgtt cgttttcttc 103 80 
gttgatggtg aaccctggta gcgccgccgc ggcacgctgc tcgaacaagg tcaacgcttc 1044 0 
acgctggtag ggacgacctt cctgttgtga cgttgatgaa tcgctgccag caggcacgga 10500 
cagtggcgga accggccagg tctgttccgc cactatccct aaatgttctc ggctggttgc 10560 
aagtatccgc aaccgcgggg ccgcggccag cagcacagcg actaggcggc tgcaggcccc 10620 
cagcaggtgc tcgcagttgt ccaacaccag caagacttgc ttgtccgcca agtaatcgac 10680 
gagcacggcc tccggggcac gtgtcgaacg gtcatggatc ccaagagcag ccgcgaccgc 1074 0 
ctgcggcacc agcatgggat cgcgaacgtt agccagctcc accagaaatg tcccggcgga 10800 
gaaggcccgc cgcacgtcgt gtgcgacgtg cagcgccagc cgggacttcc cgatcccgcc 10860 
gaaaccagtc agggtcacta accgtgaact ggacagcgcg cgcttaactc ttgtggccgc 10920 
ctgacggcgc cccacaaagc tcgtcacctc atatgggagg cgaccagcct cccgaaccgc 10980 
aaccccagac acgactcacc gcacctcggt agaactagcg ttttattgta gcccggtcat 11040 
ggctggcagc gcacttacag cttgaacaac tgttcgaacc acttcgaacc caccgacggg 11100 
gttcgttacg atctcggccg atgttgatcg tggtgggtgc ggggagggcg tgcgttgccc 11160 
gctggcctgc ccagcctgta ccactgatca aggtcgccga ccgggcccgt gtgggcgatg 1122 0 
ttggcactgg tcaggccact gctggcggtt atgcggcggg ccagctgcgg cggcacggtg 11280 
ggcggcatcc gctccgctac tggtgcgcga aggccgtcgc gtccgcctga cggcgaaagg 1134 0 
tgagctcgtc gtggaggcag cggaccgggt gatcgctgag ctggagacgc tgcgcactca 11400 
gatcaatcgc tcacgtggcg agatctcgca acgtctgcgt gtggcctcca tgcagaccgt 114 60 
caccttcatc ctgttccccg aggtgctgac tcgtctacgc cggcggcgtc caggactgga 11520 
tgtggtgatt gttcaagccg agccggatgc ggctcttcag ggtctccagg tcgacgacta 11580 
cgacctagtg atcgccgagt cctacccgca gtggccggtc atctatccgg acgggctcac 11640 
gacagagatc ctgatgaccg accgcatgca cgtcgccgca gctcccattg tgggtccacc 11700 
gcccgtcggc tgggggcagg cgctttcgcc gctggcgagt cgttcaccgc tctcggcgct 11760 
ccaacgtctt gcctgctcga agttgtcccg agtcgggccc gacgatggct gtggtgaagg 1182 0 
gatcttgacg ctcatgcatg gaggttcacc attgttggag caagaggctg atgacggtgg 11880 
cggcggtcgt tgcgccagcg gcgatcatcc agccgtgtgg gagtagcggg ctggctccga 1194 0 
agaattggct caggccggga gtttggacga cagcggccag tgccacgaac gagacaagcg 12000 
cacacgccat gaccagtggt gtccgtccgc gcacgaccag ggtctgtccg agctgggcgg 12060 
cgaccagcgc cacgagcgcg accgtgttgg cctggcgcag ggttccggtc gccctgccca 12120 
ggatccacgc acaaatggcg gccccggcgg tgaccgcggc gcggcggtag atgtcgcggg 12180 
tcagcgcagc gcccagcgag gcttcggggc cttctgccag cagcatctcc gcagtgatcg 12240 
tcgggggtgg tcgcaccgcg atcgccatcg ccgggaggat gtcggtgagc aggttgacca 12300 
gcagcaactg gcgggcgttg aggctgcccc cgccgttgat cacaccggtg acgacggtga 12360 
agccgatctc gccgaggttg ccgccgagca ggatggccag cgagtcacgc accgatgccc 12420 
acatggcgcg gccttccaca atggcgtcga cgatggtctc gatgcggtca tcggtgacca 12480 
cgaggtcggc ggcctcgcgg gcggcggggg ttgcgcgctc gcccagcgcg atgccgatgt 1254 0 
cggccaaccg gatcgccggg gcgtcgttgg cgccgtcccc ggtcaccgcg accacacggt 12600 
cgttgcgttg cagggcgtcg acgatgcgcg ccttctgcgc cgggctgacc ctcgcgaaca 12660 
ccgcgacggt ggtcacttcc gccgccagtt ggtcgccgtc gagggtgtcc agttcggtgc 12720 
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cggtcatgac gcgacgctcg ttgagcgcgt cgagttcggc ggcgatcgcc tcggcggtgc 12780 
tggggtggtc gccggtgacc atcacgacct cgacacctgc ctgctggagc tgcgcgacgg 12840 
cggcggccgc agtgggccgc accgggtcgg ccagcgcgac gaaaccgatc agccgcagat 12900 
cccggatccg ggactcgtcg agatcactgc ggtccgaggc ggcccgctcc gcgacagcca 12960 
gcacccggta gccctgccgg gcgaggtttt cgacaactcg ctcgacgcct cgacgagcgg 13020 
tctcctccaa cggcacgtcg ccatctgatc tgcgccaccg ggtgcactgg gccagcacga 13080 
tctccggggc acctttcaca ctcagcagat tgccgtcgcg ccaggtgccg agcaccgcgt 13140 
ggtagccacg ggaaggctcg aagggcatct cgtcgatcgg tacccagcgc tgcttgcctt 13200 
cggccgggtg cactcccgcg ccacgcgcgc catcgacgac cgcgcgatcg gtgggatgcg 13260 
ggagcagttc tccgttctcc accaacggac ttgcccgcaa ggcacccgcg agcactcgcc 13320 
gttccactgt ggagatttcc tcgagccacc ggctcgacgc gccatcggat acctgccgca 13380 
gcgcgatgcg cccttcggtg agggtgccgg tcttgtcgaa acacaacacc tgcgcccgcc 13440 
ccaaagcctc gatcgtcgcg gagttacgca ccagcgcccc gcgtttcgac agccgccggg 13500 
cggccgcgag ttcggcgacg gtggcaacga acggcaatcc ctcgggaacg gccgccaccg 13560 
ccaggctcac cgcccgcccc agcgatgtcc ccaatggacg cctacgcagc atgtccgtga 13620 
cgagcagaat cccacccgct cccacgcaga tcggcaccgt caccttggcc agcgtctgca 13680 
ggcgagccgc gacaccatcg gcacctcgcc gggcatcggg cccgatcgcg ctgcctgcct 13740 
cggtgttcgc gccggtcgcg accacgacac ccaccgcccg cccggcagcg accgcggtcc 13800 
cccggtagat catgcacgtc cggtccgcga ccggcaccgc cggagtcgcc gtcaccgtct 13860 
tggtgaccaa catcgactcc ccggtcaggc tcgactcatc gacctccagc ccgtcggcct 13920 
ccagaagccg gcagtcggcc ggaacagcat ccccggccac caactccacc acatcgccga 13980 
gcacgagctc gtcagccggc aactgcgtca cattttcctt acggcgcacc acaacccgaa 14 04 0 
caacgctggc ctccagaagc ctgttcaaag cgcggtccgc ggtgatgcgc tgcacgccgc 14100 
cgatcaacgc gttcatcccc agcacgccca tgatgatcgt cgcgtccgca acggcaccga 14160 
tactcgccga gatcgcggcc ccacccgcca acgccggcgt caacgggctc gccagctcct 14220 
ccagcgacgc ccgcaacaca cccatccggc gaaggcccgt cccctcgtcc tgaccttgga 14280 
aacgctcgtg cgccgcatca tcggtcagac cctgcgatga actgccgagc agctccagca 14340 
ccatgccggg gcccatcgca tgccaggcag tgcgcggcac cgcgcgcgga accggacgcc 14400 
acgcggcctg catcccccac cacgtcccag cgccgagcgc gattgccgca cacacggcca 14460 
ccggaacgcc cgcacggacc tggctcccga tggcaggccc gaacgcgccg aacaccgccc 14520 
cgctgaccga accgatcaac gacagctgcg cgctacgcac gctgacataa cgcgcggtcg 14580 
ggaccgtact caacaacgca cacgactccc ccaggctcgc gcccgagatc aggtgcgcac 14640 
cccacggaac ctcaccacca cgcgtcacac caagcccgac atccgcagcc gccagtgccg 14700 
cccgcgcccg cgcggacacc acaccgacca cgctgccctc ccgctgcatc tcccgcacca 14760 
ccgaagccag ccgcacccca cccgggacca cgcgatcaac accgatccgc cccgccaacc 14820 
ggcccccatc acccgcgaga accaccgacc cgacctcgcg agctgccgcg accaactcct 14880 
cagcgagcgg atccagctcc tcggcagcca gaacccacgc ggccgtgcgc gaatcgtggc 14 940 
gcagcgcgaa cacctcgata ccccgaccgg ccacctcgcg cacgtctggg ggcaaccgcg 15000 
accgagcgac cgagtacagc gaccaaccat cacgctgcgc gccgaccccc ttggtcacac 15060 
ccacgagttc atgcgcacgt tcccacaact cgacgaccgc gcgccccccg tccgcgggaa 1512 0 
tcacgtcatg caccaaccga gcaccggtgc gcagaacgtc cgcgtcgatc accaccgtgt 15180 
cgacccggtc caaccgccgc aacacccggg catcgaacac cagagtcgtc cgctccgaca 15240 
gcccgtaccc gagctgcgcc gcgaacgcgc cccgccccac ctgcgccgcg cgcggcgcgc 15300 
ccgcgaccac catcgcctgc gcacgctgaa aactccgcgt caccccgagc atgcccgcga 15360 
acgccgccgg cccacccaga gccgaaccgt tcgcggcgcg ctcgaccggg ccggcgggca 15420 
acggcacagg acgcggccga acccccaccg gcgcactacg gtgactggca ggatgcgccc 15480 
ccgttccggc ttcccaccgc gcccagctcc gccgcgccgc ctgcgtctcc cgcaacaggc 1554 0 
agaaccggta ccccgattcc gccagcaaac tcgccggacg ccgcgcaagc gtgttcccca 15600 
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acgcactgct aaacccgaac aacaggtcag tggcggtctt cccgatccgc gtttccagcg 15660 
gcccccggac ccaggaggcc gactcaccgg tcgacaccag cgccggaaca gacggcggca 1572 0 



ccagcaccgt cgcctcagcg atcaccttga ccggattgtc cgggtggctc acaccgaccg 15840 



gctcagcgac cccaaccagg cccggatcgt gcccgatgac cacccgcccc acgacagcgt 15960 
tgatctcgac ccacgcaaca cccggcacct tctccaaccg ccgggcgatc tgctcgccag 16020 
cctcctcagc ccccggctgg tcaacaccgc gcacctcgac gtacgcacgc ccctccgacg 16080 
accacgcccg ccgcccgcgc cccaagcccg cgatgctctg cgcgacagcc accaccggcg 16140 
ccacgatctg cgaggccaac gcgcccaagg tacgcagcat cgaacacact cagctgtccg 16200 
acgcaaagcc gaccgacgcc cgtgacaccc ccgcgccggt cgcccccgca accggccatt 16260 
cgagccccca accgacccgt accgcgcgaa ccgcgtcctc ccaacgaccc ggcatgcgaa 16320 
catcgacccg acgaaactgt gccacgacgg gcaggctgac cgccgcggct cgcaaccgcg 163 80 
ccccttgttg ccgcgcgcct gcgccaggcg ttgtgttccg cgccgtccta gccaccacta 16440 
cctcccgcgc gcacagatac cgtgagtcac gagcgtagga ccgacgagcg gaaagcgaac 16500 
ctcgaaacag ccgcacatcc gtcctctcgg tggcctgcga gctgatcagc aacaccgacg 16560 
gcttcgccac gctgttcgac ccgccgacac accagtagtc ccagcggtgg tgaattcaag 16620 
gccgcctgtg tgctcgggtt cttcgaaacg gctccgagca tgcagctacg ccggacgcag 16680 
gcaacgcaac acctacccag gttcacctgg cctgagcgcc gccgcggcgc gctcagcgac 16740 
gcctagcccg gaacccgcta caaaccggat cacttcgacg ccggtccgcc ggcacgctgc 16800 
gcgaccgcga cacccacgcc aatcgccgcc gccaccggcc agtccatcaa ccccaacagg 16860 
gtcagcgctc caagaccacg atagaaggcc atcgtctgat ggtccgggag aacgccccga 16920 
acggcctcag ccgcgctagc aagatcatca cgagtgggaa tgtagtgatc gggacggtgg 16980 
aactcggccg ttacgcccgg cagactcacc gtcgttcccc cagcacccgc cggtttccgc 1704 0 
tgcgtagtaa ccacggcacc tcctcaacac aggccccatg ccgcataatc ggcctcgcac 17100 
ggactcaaaa ccgcccaaac cgaccaaccg ggcaacacga ttttatctct gaacacgaac 17160 
gtccccgcgc cgagacggct ggaatacggg ctgccgccgg acctgaacca gtcaggcgtt 1722 0 
gtgcttcaac aacgttgctg tgcacccacc tgctcgggtt ggccaaccgt gtggacgcaa 172 80 
cgccggctcc agtgactgct ggtgatcaca ctcgttgatc atgatcgtct ggactgcggt 1734 0 
ttgatcaagg caatgttccc gcacctggcc ggggtgcggg agcggctggt gcgccggttc 174 00 
gcggagttca ccaacgagta tccgtggcgg tggacaccgt cgcatgtgga cgaatggtcg 174 60 
caatcaccga ccggagaacg gcatctcgcg ccatccacaa tccgcgccta ccaaggcagt 17520 
ctcaggctgt tcaacgaatt cctctgtgac agccggtatg gctgggcgct ggcgtgcgag 17580 
gcgtcgttcg ggccggacga gtacccggtg gcgaatgtgc acgagtggaa ctcactcgcc 17640 
catctccagt cctatgaggg cgatccggac gaccgcccgt tcacccgcga tgagctgcag 177 00 
cggttccttg actacgccac gaccaagtcg accgcgcggt tcgcgcgaaa cgcaacggcg 17760 
cattggcagc ctatcgagac gcgaccttgt tcaaagtgat gtatcgctag ggcttgttcg 17820 
ccggggccct acccgcgtgc gctcccgcga ttttttgagt ttgtgcagtt cagaggacga 17880 
tgtgggagat gctagccgtc gcgacagatg tggtccaagt aggcgcggca ggcaaacctc 17940 
agcggctcgc gtgcagtatg cggctcgcgc gcagtgcgcg gggtggttcg ccctcgaaac 18000 
ctgcggatcc gtggaggtag ccggcgatga ggtccggggc ggagtggtct tcgatgccgg 18060 
tgaacccgag ggcgtgaagc tgtgcggcga tgtcgtcggt tgtgaagtag ctgaacaagg 18120 
gttcgccggc gccggccagc cgatcggcac gtgcccgctg gtgcgcacgg tcctcgtcgg 18180 
tggccgcagg ctgcaggtag tcgaagacca cctcgaccgg ctgggcttga ccagcgatgt 18240 
attcgagggt ggcgtgggcg gcgttcgcgg tcagatagaa gacaacgccg agccacacga 183 00 
atactgccgg gtccgtccgt ttgaatccgg cggattccaa tccagttgcc agcggctggg 183 60 
tttcgaaatc gaccggtacg aaggtcaacg tctcagggcg gtcgatccct gcgctagcaa 184 2 0 
ggcgttcgcg tttccatgct tgggtggcgg ggtggtcgac ctcgaagacg cgcaagtccg 184 80 



gcacgggcag cggcagcatc 



cgggccccca ccgcgatcac agcgccggtc aagttcgcac 15780 



cagcgaacgg ttcctcgtcc 



aaaccggtgt ccgcctccgc gtcctccacc acacccacca 15900 
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ggcgcggatt gcgataggcg aaggtgtcca 
caccatcggc gatggccgcg gccacggcgt 
acaggcggcg gggctgatcg attgccccgg 
atttggtcag ttcctctgcg gtgacgccga 
tctgtggtcg gtcagcgatc tggtgatagg 
ggctgggtcc gccgttctcc at'gttcaaga 
gaactcactc acgcccgctt cacaaaggag 
ccctccttta gggcgattgc cacgagtcga 
tgggttgatg tgttgagccg ggtcatgagt 
cgccggcgcg ttcgtcgagt tgctgttgct 
tcaacggtca cggtgcgttc ggaccgcgtc 
accgggccct ccccattgat gtggccgctg 
cgccggttcc cgcagcccgc cttccccccg 
acgcgttcac ggggacgatc atcggtccac 
gcggctattc caggggcccg gccgtggttg 
tgggcgcggc atgatgagcg accgtgctgc 
cgtccgtgca ggtcacacag cagcgtctga 
tcacctcacg cagcgaatgg cgcgccagcc 
ccggtagctc ggctggaggt tgcgggatga 
gatgaccgtg cgactccgcg agcgttgcaa 
ggccgttccg cttgaccacc cgcccatcga 
tctgcgcaat gatcgcgcca caagggtcgt 
acatgtccag caatacgatg tcggcccgct 
cgacgcccgc ggcgcgggca ccgttgaccg 
accgcatgac cgtgggcagt gtcgaccgcg 
tcacctgtag gacgagccgt gcgtgcgaaa 
cgatgccgag gcttggcccc ggcgtgttct 
tgcccatctg catctcggtc tcggcgcaaa 
ggtgccattc gttctcagtg ctgaatgtcc 
tgttgtggga gtgcagcgcc tcgatgtcct 
agtggaacgt gatgcgtgcg ccgagctcgc 
cgaagggagc gatggctagt tcctgcgggg 
tcgggaagta ctgggcgcgg atttcccggg 
cttccaccga ctcgtccagc ccgcctttgg 
ggaggccgtg gccgtacagc gcgcgcacgc 
gcgcgtgctc gccggtgacg atgttgtggc 
taaggcaatc gaggccaccg aggtagttgc 
tcgccatctg cagccggaat cccctcaggt 
cccggagccc ggtttgccag gtgtgtcgat 
tcgacgaggc gtcgattcgc tcggtgtcgc 
tgacaccgtt gtcgacgagg atgtcggctc 
tcaccacggc agcgttctcg atcagtgtgc 
ggcatcggat gccgtcctca agcggtcccg 
ctgtggagag gggccgtctc gcagacgggg 
accggcggtt gtcagtcgca tcgtcgcgcg 
tgcgtccgag tggtgaagcc gagtttggtc 
cggcgggaga tcaccatgat ggcggcgatt 
gctgctacct ctcgctcgcg cggagtaagc 



ggcccgcgcc gaggatcacg acctgccgta 1854 0 
cctcggcgaa gcgggcacgg gcggcgaaga 18600 
cgccgaggtg atcggttgtg ggcgtggcca 18660 
gcagacgtac cgccaggggg tcggtgaaga 18720 
cgcgggcgta cgcggtagcg agcgctgtcc 18780 
acttactcac gaacgcggcg cgggctctat 18840 
cacgcgaaca cgatccgtaa ccacagctgg 18900 
caggcagggc tcgaaccctg ttatcggggc 18960 
tcgcggttgg tggccccggc gcggcaagat 19020 
gcggcaagaa acgacagctc ggccgaccgt 19080 
ctgccaccac acctgacggt ctttgtcgtc 19140 
atgctcgaat gcgcacgagt ggtcagtcaa 19200 
gaagacggcg cgttcccgca cggcagcccc 19260 
gaggcgaata cctgctccgt gcaggtggaa 19320 
ctcgaatcgg cgctcgcacg cggctgacgg 193 80 
tgcgactggc cttgcctatg ccgataacct 19440 
cggttttccg gtcgagcccg gccagccggg 19500 
acagctggct tggttctcag cggttgaaca 195 60 
atccgccgtg ctgggccatt tggtcgtaga 19620 
gcgctccgtt gacgtcgacg tgcaccagcc 19680 
ccagcaccgt gtgcacgttg cccgagttgg 19740 
ggcggttcca accggcctgg ctgatgcctg 19800 
tgcccggtgc cagggttcct gtcacgtcgt 19860 
tgagccaccg taaggcgtcg cgcgttttcc 19920 
cgtagtcttc ttgatcgtcc ctccatcggg 19980 
tcatgtcgcc gcaggtactg ctcgtgcagt 2004 0 
cggtggcctc acgaatgacc gggaatccca 20100 
ccgagatcat ggtgccggta ccccgcagca 20160 
cgtgcaccaa gagaaggtcg ctgccgagca 20220 
tgaacaggtt gcgcgcgatg acttggttgc 202 80 
gggccagctc gaattcctgc ttgacgtccc 20340 
cgatgccgaa gcgcagcaat tgttgctcgc 20400 
ccagccgcgc ccgccgggcg aagtcgcctt 204 60 
tctcggacca cgtgttcgac gtgatgggca 20520 
ccgcgtcgcg gagtccagcg accgcggcgt 20580 
agtaatcgac cagcgtggtc acccccgagt 2 0640 
ccgcatacat gtcctgcggg cggtacttcg 20700 
agtcgaggat gttcccgtcc gccagaatgc 2 0760 
gggtgtcgac catgcccggc atggcgatca 2 0820 
caacgttgag gttcggcccg atctcgacga 2 0880 
gttcgaagtc gccgagaacg tcgtccatcg 2 0940 
gcatggccaa gtgctcctct ccgcgcggtg 21000 
ttcggtcgtt gatcgactgt ggagaagagg 21060 
cagcttctac tggtccacgc caatccgaaa 21120 
gtcttctgtt gaatgaccca ggtcgcgatt 21180 
aggatgtgct cgacgtgggc ctcagcggtg 21240 
tctttgttgg tggtgccctg tgcgaccagc 21300 
accgacgcgt gcgtttcttc cgtcctggag 21360 
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cttgctggtt ccccggcttt gcccggattc cgtatccatg cgatggcctc gtcgaagtcg 2142 0 
agctgtgtgg cgtggtgaac gcctttctga aacgtctcgt cgccgagtga gctacgtgct 21480 
actgcctcgt agtggtcgtg taggcgcgca acatgtttgt ggccggcgag cgacgcatcg 2154 0 
acgttctttt tgatcgtttg ggacacgccg aacaaggtcg cagcccgtgc cgggtcaccg 21600 
tctgcgctgg ctatgcaggc cagaatctca acgcacaacg cgatctgaaa ctgctcatgg 21660 
aagccgagcc gcaggcgaag actctccagt tccgcagcat tggcttgccg gtaattgccc 21720 
gctagtaggt ctgctattcc cttcgcccac agggcaagcg atctcatcca ctgttcgctg 21780 
taggtctcct tgacggcagg gcacccgcgg aacagggcag cggcgcgctc atgatcacca 21840 
aggaacgcga ctgccaacgc gagctcgatc tgatcgtagg cggcggcagc gtgatcgcca 21900 
atcagccgat gcttggcgag agcgtcttcg aagagcgcgc tcgcccgcgc cggatcttcg 21960 
ttgaacaggg cggtgagccc cgagatctgc gcgacgtagg ccgcccctga ttggtcgccc 22020 
aactccgccg ccagcgcgct cgcctgctcg agaagggctg acccggatga cgcgtcgttg 22080 
agcgcggcat tcaggtaccc gtcgacgtag agagctttga cacgtaccgg atccgcatcg 22140 
aatcgatggt aaccggacat atgcggctaa tggtgcacaa cgggcagtgt tgatctgacc 22200 
acgctaaccg gtatctcggg cgggtcgcgt gctgcgccgt tcaagctcaa gcgtgcggtg 22260 
cgtcttccgc tcgacgatct acaagtacgt ccccgaactc agcacccacc gagcgctcgc 22320 
ctctgggccc ggtccgcagc cagcgttcca gagctaagcg ctggtttccg gcgtctgcta 223 80 
ccgggacgct tcggccgccg tgggttcagc gcaagtgagg gttcctcggc gggaacccaa 2244 0 
gggtccgtcc cgcccaggac caggactgga cccgtgggtt cggctcaacc cggagtgtcg 22500 
catttgatcg tggtttccag cagtaatgga tgctgaatct tgtgattgac gccgccgcga 22560 
ccggagagtg tcttggagca accgattccc gcgcgattcc ggaccatcca tcggacctag 22620 
gttctgttgc gacctgagcc cgccggcgcg gcggagtttc cgcctcaagt ggccaggtcc 22680 
agccattcct . cgatgggcag cacgtgtgcg cggtgtccgg cgtgcgtgcg gaactcattc 22740 
tccgggtcgg agacctcgac gtagcccgcg atcttgtccg gctgttccgc ctcgtagtgc 22800 
atcggcactg cgtaccgggc gtcgaggatc tctgcggctg cggcggcctg cctcgggtcc 22860 
atcgcggcgg gcagcgggct cggcggctgg aggtgtggcg cgtcgaccac cgcgccgttg 22 920 
gcgggcagga acaccgcgtc gaacgggctg aaccggcgcg cgatgagcca ccagtagccg 22980 
tggaacatcg tgtcgccgcc gtggaagacc cgctgcccgt cggcctgcac cacccagttc 23040 
agctgcgggt cgcccagccc gtcgacggcg gggaccgcgg tgacgcggaa cggcccgagg 23100 
tcgcgggtgg accaggcatc cacgacctcg gcagccagtc ggtgcagtgt cagctcgcgc 23160 
tcggccggca gtgtcgtcac gttgtccacg tcgtcgccgt ggccgggcgc gggtcggagc 23220 
accggtgccc ccggtgtcag cgcggccgcg agcgcggccg cgtcggtgtg gtcccggtgc 23280 
aggtgggtga ccagtgcggc ggtgaccgtc ccgctcggca ccgccagcct ctcgccgggt 23340 
ttccacgccg tgaacagcgg tgagaggtcc cgtacgtagt cgatcaccag tcgcttgccg 23400 
cccgcctcga tttccagtcc ggcccagccc agtcgtcgca cccgcatctc gcactccttc 23460 
ggtccgtggg tgaagcggaa tttagcgtac gcacgttcac taaatcaata gcgaacgcgc 23520 
gtacgctatc ctcgtcacat gtcaccgcga cgctcagcgg ccgaagcaca ggccaccagg 23580 
agccggatcc tcgaccgtgc cgccgagatc gcatccgagg aagggctgga cggcatcacc 23 640 
atcggccggc tcgccgagga actggagatg agcaagtccg gggtgcacaa gcacttcggc 237 00 
accaaggaga cactgcagat ctccacgctg gacaaggcat tcgtggactt ctggcaccgg 23760 
gtggtcgagc ccgcactggc cgagccgccg ggtctgqggc ggctgcgcgc ggtgtgcgcc 23 820 
aacgccgtgg gctacctaga agcgccactg ctgcccggcg gctgcctgat gaccgcggcg 23 880 
ctcaccgagt acgacggccg ccccggccgg gtccgcgacg cggtggccga ggtgtggtcg 23940 
cgctggcggg gacagctgcg ggcggacctg accgcggcgg tggagaacgg cgagctgccc 24 000 
gccgggttcg acatcgacca ggcgctgttc gagatcgtcg ccaccgggct. ggcactgaac 24 060 
gcggccatgc agctccagca cgaccgggcg gccgccggcc gggcccgccg cgcgatcgaa 24120 
cgggccttgg accagtcctg acgcccgcgc cagacaaacc cgcccgattc tttcccgagg 24180 
accagaccgt ctgaccgcgt tcctgctggt cagcccgtat ccatcccatt tttcggcgag 24240 
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tacgccctag catggtcacg agacattccc 
aaagctactc gccggttgct ttgcaccttg 
cccgtgggtt cagtgtgagt gaaggttcct 
ggtctgaaga cttcgtgaag aaggtatgaa 
atgggaagcc aaggcttggg agcgtggaca 
ccaccatgat cttcaacctg gtagccccgg 
tgcggtacga cagccgcaat ccgtacgaga 
gtcaggtgga ctgggtgatc gcccgcgacc 
gcgaaggcga tgtgcggatc ggccctcgac 
tgagctcgcc gtcggggcag gcctccttcg 
tgaacgacac ctacgacgtg gtcgaacctg 
aggtgctgag ccagctgctc tcgcacaacc 
acgccaaagc gctggctgcg ggacctggcg 
gctccagctg gaccacgtcg gtgccgtgcg 
ttctccaggc gcgccatcgg cgcaggaagc 
tgtcatggcc acggacagct tcgattcctc 
cgcgtcgtgc gtctcggaag tcagccgaga 
gcagttgcga gccgatgaga tcgtcctcag 
cgccacattg ccgcacgttc tcgaattcga 
tgcggtgttg gagtatccgc gacgactggc 
ccaaccgcat cgggcacttc cgcttcctgt 
gcgccgacga ggccgtatcc gcagcgacca 
acatcgaccg cgtcacggag cagatcgccg 
tcctggcgac gggacggccg tgaggtctct 
aggcacatga ggtgttgcgc gcggtcgttt 
cgggcaattg agggagcgaa gatttagagg 
gctcgggagc ggctgtgggg cggccaggcg 
accggtacgg cggcgaggcc ttcgagcagg 
gtagtgcggc cgagtgcgat catttggaac 
gggccatctg gatagacgcc gtcggggccg 
gagctgacat ccgccatccg tacatggggc 
atggcgactt gttgctcggt gttcagaatt 
ggttgatgca acagcgccac gtcggcccgg 
tcgcagagca gcaggtcgac ggccacggct 
ttcgccagca gctcaccgga ggcgccggcc 
gtcgcggcac gctgggtgcg tcgctcggct 
tcggtcagca gcattgcccc ggcttcggtg 
aacacgactc cgagtcgttg ctcgagctgg 
atgcccaggc gctgggcggc ccggccgaag 
cgcaactccc gcgtctccat ccgtcgagcc 
tgtgagacct agatggtgtt ggttccccgc 
gaacagacga ttgcactggt caccggcgca 
gggctcggcg cgctggggtg gagcgtcgga 
gatgccgtgg cgaaattgcg tgcggacggc 
acagacgacg cgagcgtcgc ggctgctgcg 
gatgtgctgg ttaataacgc cggcatcgcc 
acaccggcga gcctccgggc ggtggtggag 
aacgctatgc tgccgttgct acgccgctcc 



ggagccgtca ttgattgact ggcgcagcga 24300 
ggtcagcttc gggagttccg ggcccagagt 24 360 
cgctgggagc ccgaggggga ctgccaccag 24420 
cccgggttgg gcatgcttcg ttatgagggt 24480 
tcatggcaaa cgatcacggc acggtgctcg 24540 
cgggtgtgat cgcgccagtc ggcgtggagc 24 600 
tctccatgaa gctcaacgta ggcacggacg 24660 
tgctggccga cgggctgatc gccgaggcag 24720 
ggggttttcc ggggttggtc gtgatcgaga 24780 
aggtgaatgc tgaccagctt gcggacttct 24840 
gtgatgaaca ccggtggatg aacgtcgacg 24900 
tgtaatggcc cagctctccc gaagcgccgc 24 960 
gcgctgaaca ccgccacgct gtgtctccga 25020 
cccggctcgg tcaggccgaa ggtgctgatc 25080 
gctgcttctg ctcccgccgc agtaccgtcg 25140 
gaagctacag gcggccgtgg catcgagcgt 25200 
cgtctacacg cacctgatta ccgaggctcc 25260 
cattctacgg acgagtgttg aggaaaatat 25320 
gattccgttg ggatattcgc cgggtcctgc 253 80 
gaaacatttc catcaacgcg ctgatcaggg 25440 
agtgatgcct cgacgagatc cgccgccaat 25500 
cgcaacgaat gctcgcaacc agcttcggct 25560 
aaacctacca gctcgaacgg gaccgctggc 25620 
gcggcatccg catagcgtct tctcccgctg 25680 
ccggcagtcg cacggcattc gtcctagctg 25740 
agtgtggcca cgcggaccaa gccggcgagt 25800 
atgactgtcg tcacgtccgg cgcgtctaga 25860 
ttgacgcgac tggattcggg catgaccacg 25920 
agttgcgtct ggttgcgtac ttccacgccg 25980 
ggccagcgcg caagcgggag atccggcagt 26040 
tcgctggcaa gcggatgcga ggtcggaaga 26100 
tcgatgtcga gttcggccgt cgggtcgaag 26160 
ccgtcatgca gcgttttctg gggctgggat 26220 
cccggctcgg cggcgtacgc gtcgagcaac 26280 
ttggcagcca ggactagcga gggctggctc 26340 
gctgccagcg cgccgaggat cgcccggcct 26400 
agcgagactt tgcggctggt gcgttgcagc 26460 
gcgatcgtcc gcgacagcgg cggctgggcg 26520 
tgcaactcct cggcgactgc aacgaagtac 26580 
taccgctgat tcatatcagc tgggtatcgg 26640 
cggtttcggg ccacgctaga aagcatgagc 26700 
aacaagggaa tcggatacga gatcgcggcc 26760 
atcggggcac gggaccacca gcgcggggag 2 6820 
gtcgatgcgt tcgcggtatc cctggacgtg 2 6880 
gctctgctcg aggagcgcgc cggccggctc 26940 
ggggcatggc cggaggagcc ctcgaccgtc 27000 
accaacgtga tcggcgtcgt tcgggttacc 27060 
gagcgcccgc ggatcgtcaa ccagtccagc 27120 
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cacgtcgctt ccctgacctt gcaaaccacg ccgggcgtcg acctcggcgg gatcagcgga 27180 
gcctactcac cgtcgaagac gttcctcaac gcgatcacca tccagtacgc caaggaactc 27240 
agcgatacca acatcaaaat caacaacgcc tgccccggct acgtcgcgac cgaccttaac 27300 
ggcttccacg gaaccagcac gccggcagac ggtgccagga tcgccattcg gctcgccacg 273 60 
ctgccagacg acggcccgac cggaggcatg ttcgacgacg ccgggaatgt gccctggtga 27420 
ggcgct'cagt cggcgatggt gcaatcgaag tcggagaggc tcgctgcgac cgggtacgcc 27480 
gaacaacacc tgttcctgtg ggtacggatg tcggccttcg ccgtctcggt cattgacaac 27540 
ctgtacttcg ggcgccgtta ccgccggtgc gccgcggttg cctggcgaca ctgggccagc 27600 
cgtggctcac cggcggctta ggtcaggcgt gggcggttgc cagcatggcg ggtgcggctt 27660 
tgcgtaggtc gggtaggcgc atccggcgcg ggagccggtc gagttcttcg ccgatggccg 27720 
gtgctttggg gctgctcagg agccgaacac ctcccagccg caggtgccgg gctgaaccga 27780 
gtggttctcg tcggctcgga tcacaacgtc tgccggaaca gctgcggcga ggtggtcgca 27840 
gattcgaggc gggatcgtcc tcggcgacct tgccgacgat cgcggctagg gcccagggct 27 900 
tcgtcgacct ggttggcacc tagatcacga cggtcaaaac ttgccggcat cagagacgat 27960 
cgaagtgatc ccgggtcacg tcggcttatc ggtcgagtga gtcccggggc ctgcccagcc 28020 
aggtcttgcg tcgttgttcc gggctcagtt gcggattccg acgaacaggc ctcggccgtt 2 8080 
cggtgctcca ggaaggtatt ccgcgcggat ccctgcgtct tcgagcgcgg cggtgtactc 28140 
gtcctcagtg aacagcgaga ggatttcgaa ctctgtgaag tcccggatcc cggtgggttc 28200 
ggcgactgtg tagcggacgg tcatccggct cgtacggccc tccaggaccg agtgcgatag 28260 
ccggctgatc acccgctcgc cgtggtgcgc gacggctccg gtgacgaacc cgtcgatgaa 28320 
cttgtcggga aaccaccagg gttcgatgac cgcgactcca ccaggggcca ggtgccgggc 28380 
catgttccgc gtcacgcgtc gcaggtcgtc aacggtccgc atgtaagccg cggtaaagca 28440 
caggcaggtg atgacgtcga atggctcgcc gaggtcgaaa tcgcggatgt caccgatgtg 2 85.00 
aatcggtacc tcagggactc gtctgatcgc gatctcccgc atcgcatcgg acagttcaag 2 8560 
ccccgcgacc ttcgcgtatt cggcacggaa tcgctctagg tgcgccccgg tcccacaggc 2 8620 
gacgtcgagt agggactgtg cttcgggcag cctggtgcgt acgagctgga ctacttcccc 2 8680 
ggcctcggct gcccagtccc ggccacgcgc ggagtggatc gcgtcgtaga tgtcggcatg 2 8740 
atctgggctg tataccgagg aggtttctgc gaatgtgtcg ctcacgcgcg acatcctcac 2 8800 
tttcggagtg gtgatctttg gctgatgtgg tgttcgacgg ccttctggaa ctcgtcagcc 28860 
accgtgcgca cctcggcgtc gtcaaggctt gggtgcagtg gtagcaggag tgttctgcgg 2 8920 
caggcgtgct ccgcagaagg cagcttgcag tccgcgcggt agatggggac cttgtgcagg 2 8980 
ggcgggtagc ggtagctcgt gtagatgccg cgttccagca tttgctgcgc cacctggtcg 29040 
cggatctccg gagccagctg gacccagtag aagtagtgtg acgagacgtg cccatccggt 29100 
agcgtcggcg gtaggaggac acccggcaca tcggaaagca accggtcgta ctgcgtagcg 29160 
atttctctac gcctgttgat gaattctggc agtttgcgca gctgcacgct gccaagcgct 29220 
gccgtcatgt cgttcccgat cagccgctgg ccgatgtctt cgacgcgaat atcccaccag 29280 
cggttggaag acttggccga atcgaatccg ctcatctgct caagaccgtg gtaggcgagt 29340 
cgtcttgcgc ggtgcgccag ctccggatcc gccgcgtaga acatgccccc atccccggtg 294 00 
accaggatct tcatcgcatc gaaactccac gtggccaggt caccaaaggt tccgcaagcg 29460 
gtgccgtgca cggacgatgc caccgcgcag gcggagtcct cgatgagcat gaggcccttt 29520 
tcacggcaga aatcggcgat cgcggtgact tctcccggcg atcctccata gtggagcagc 29580 
aatacggcct tggtcgccgg cgtgatggcc ctcgccacat catccagcgt ggggttcaac 29640 
gtccgggggt cgacgtcgca gaacaccggg cgggcaccgg aggatgcgat ggcgttggcc 29700 
gccgccacga agcttatcga aggaagtacc acgtcgtcgc ctgggccgag gtcgagcacc 29760 
tgcacggtaa ggaacagcgc ggcagtcccc gagttgagga acacgacctg ttcgggatcc 29820 
actcccaggt ggtgggcgaa ttcggcctcg aacgtccggg tgcgcggccc gagcccgatc 2 9880 
cagttggagg cgaacacctc cgcgatcgcg tcgagttctt cggtgccgag gatcggctgg 29940 
tgcaggttga tcacgttgct gaaatcctcc gagatgccgc catgctggat gctaggaact 30000 
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cttggccacg aattcagcga ttgattcgac 
tgggtagacg ccgacccaga aggttcggtc 
ggcgatccgg taccgcacct gctcgaaggc 
cagtcgggtg ccgatgttgc gggattccag 
ggcgtccgca 'Ctgatggtga tcgcaaaccc 
taccggcagc agcaggcccg gcaacccgga 
gcggcgtgcc gacccgaatg cggaaatctt 
caggtcggtg gtcttgaggt tgtaaccgac 
cggtggaagg gtaccgaggt ggtagtcgaa 
ctcgcaccag caatcccgtc cccagtcacg 
gctgccggtc aacacgcagc caccctcgcc 
gaccgttgtc aggtcgccga aggttccggt 
atcacagttg tcttcgacga ggaacagctc 
agcggcgaag gggttgccca gggtgtgcgc 
ggcggccttg atgcggtctg gcgttgcgtt 
cgggacgagt ccgttttgga ccgccggatt 
gatcacttcg tcgccgggcc gcagtcgtgc 
cagtgccagg agattggccg acgaaccgga 
gaagcgggcg aactcgctct cgaatcgccg 
cagcgcggct tccaccagtg ccacccggtc 
gatcggcgtc gatccagcca cgaaggtcgg 
ggatgccaat atccggtcct tggcatccgg 
gtcacacgaa gtcactctgg cgcgcccttt 
gcaggcgacg atcagtcttc gcgccttgcc 
cgcgttatga cgtcccagcg ggacagtgtg 
ggttcgatgc ggtgcgggac atcaggacag 
aaatcctcga tctacccggc acacccgact 
gctcgcgctt gtcaagatcg ctaccatcac 
cggcgtttcc cgggggatat cagagatttc 
aattgcgatc ggagagttca tgcgtgtcct 
cttcaacctg gtgccgttgg cgtgggcgtt 
catctgcccg aatatggtgt cgatggtcac 
cgacgagctc gacctcatct ccttggcggc 
ctcgttcgac gagaaggggc ggcatccgga 
cggcagagac acggacgccg tggagcaact 
tctcatgggg ttcgccgaga aatggcagcc 
ttcggggcca gttgtggcgc gagcgctcgg 
cgatgtgtcg gggtggctgc ggtccggttt 
gcagcgcgtc gacccgctcg ggacgtggct 
gttcgatgaa gagatcgtga cgggccaagc 
cctgcctgtg gacttggact acatctcgat 
gttgccggag tggttgcgcg aacgaccgac 
gaccaagcgg cggctgagca gggtgaccga 
agcaatggtg gaaaggttgt tgcgcggcgc 
cttgtctgac gacgaagtac gggagatggg 
atacgtaccg ctcaacgaac tgctggagtc 
gacgacgcag gaaaccgcca cggtcaacgg 
ctgggacgaa tctcgtaggg cggagctcct 



gacgtagtcg atcatttggt ccgttatgcc 30060 
ggtgacgatg tcgctgttgg tgagcgcgtc 30120 
cgggtgccgg gtgatgttac cgccgaacag 30180 
gaagttcacc agggcggcac gggtgaaccc 30240 
gaaccagctc gggtcgctgt gcggtgtggc 30300 
cagcccttcg cgcaaccgtc gccagttacg 30360 
gctcaactgg ctcagcgcaa gtgcggcctg 3 0420 
gtgggagaac gtgtacttgt ggtcgtagcc 30480 
cctcttgcgg caggtgttgt ccacgccggg 3 054 0 
cagcgactcg atgatgcgag ccaattccag 30600 
gctggtgatg tgatgggcag gatagaagct 3 0660 
cagccgtccc cggtaggtgg atcccaccgc 3 0720 
gtgttctttt gcgatctccg cgatttcgtc 30780 
cagcatgatg gctcgcgtcc gttccgtgac 3 084 0 
gtaggtgccc agttccacgt cgacgaatac 30900 
gatcgtcgtg gggaagccga ccgccgcagt 3 0960 
ctcgccgagt ttgggggagg taagcgaact 3102 0 
gttgacgaga tgagccttgc ggaggccgaa 31080 
tgcattcccg cccgcggcga tccggagctc 3114 0 
gtcctcgtcg agcacggcgc ccgatggccg 31200 
ggattcctgt tcgcggtggt aatcgcgtac 31260 
caccatctca gtagcggtag cgcaagtgtc 31320 
ccccagcgct ctggttttcc ggctctgcat 31380 
ttcaggagat gagcgatgcc cgtggcgaat 31440 
ctgtctcggc gccttacacc ttcctgccct 31500 
cggagcaagg agaagcgctc attgactcag 31560 
cggtagagcc caggctagcg ggaacgacct 31620 
ctggaaggcc taagatttgg cttgcgaaag 31680 
tgtgattctt ggcatgcttc ccgggtgttc 31740 
gttcaccccg ctgccggcga gttcgcactt 31800 
gcgtgccgcg gggcacgagg tccgtgtcgc 31860 
cggagcagga ctcaccgcgg ttcccgtcgg 31920 
caagaacgaa ctcgttctcg gcagcggggt 31980 
actcttcgac gagctgctgt caatcaactc 3204 0 
ccaccttgtg gatgaccgat cgctggacga 32100 
tgatctcgtt gtgtgggacg ctatggtgtg 32160 
cgcacgacac gtgcggatgc tcgtcgccct 32220 
cctcgaatac caggaatcga agccgcctga 32280 
gggagcgaag ctcgccaagt tcggagccac 32340 
gaccatagat ccgattccat cctggatgcg 32400 
gcgtttcgtg ccgtacaacg gtccggcggt 32460 
gaagccgcgc gtctgcatca cgcgcgggct 32520 
acagtacggg gagcaaagtg accaggaaca 32580 
ggccaggctc gacgtcgagg tgatcgccac 32640 
ggagttgccc tcgaacgtcc gggtccacga 32700 
gtgttcagtg atcatccatc atggctcgac 32 760 
c.gtaccgcag ttgattctcc ctgggacctt 32820 
agccgatcgg ggagccggtc tggtcctcga 32880 
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ccccgcgacg tttaccgaag acgacgtgcg aggtcagctg gcccgcctgc tcgacgagcc 32 940 
gtcgttcgct gccaacgcgg cgctgatccg ccgtgaaatc gaggaaagtc ccagcccgca 33000 
cgacatcgtt ccacgtctgg aaaagctagt tgccgaacgt gagaaccgcc gcactgggca 33060 
gtctgatggc catccgtgag caacgtgtgg ccggaaacat ggacgccggg gtttggcagg 33120 
tgttcatcgc tgttgcgtcg actcggattc cgccgtgacc gggacgatgc caggcgagtc 33180. 
ccgaagtcag attcttgtcc agaatcgtcc aatggggtgt tgatctcccc agaggtttgc 33240 
gctccaaccg atttccgacg aggatcgtgg cgcccgctga .gcaacgacta ccgtgcggtc 33300 
gagacatacc gctgtgcgcc aggagcgaag gtgggttgcc cgatcaccgt gctggtggta 33360 
gatgccgagc cgaaggtcac cttggatgag gcggaagcct ggcgagagca caccgaggcc 33420 
gtggccgacg tccgtgtctt ctccggcggg catttcttca tgaccgaacg ccaggacgag 33480 
gtgctcgcgg tccttacggg cggatcgctt cgatgatcct cgccaggccg ctggaccaga 3354 0 
ccgcgacgcc cctgggagcc ggcgtgcaca tcgtcacggc agtgagggat tgggcatgag 33600 
cagttctgtc gaagctgagg caagtgctgc tgcgccgctc ggcagcaaca acacgcggcg 33660 
gttcgtcgac tctgcgctga gcgcttgcaa tggcatgatt ccgaccacgg agttccactg 33720 
ctggctcgcc gatcggctgg gcgagaacag cttcgagacc aatcgcatcc cgttcgaccg 33780 
cctgtcgaaa tggaaattcg atgccagcac ggagaacctg gttcatgccg acggtaggtt 3384 0 
cttcacggta gaaggcctgc aggtcgagac caactatggc gcggcaccca gctggcacca 33900 
gccgatcatc aaccaggctg aagtaggtat cctcggcatt ctcgtcaagg agatcgacgg 33 960 
cgtgctgcac tgcctcatgt cagcaaagat ggaaccgggc aacgtcaacg tcctgcagct 34 020 
ctcgccgacg gttcaggcaa ctcggagcaa ctacacgcag gcacaccgtg gcagcgttcc 34080 
gccctatgtg gactacttcc tcgggcgggg ccgcggccgc gtgctggtag acgtgctcca 3414 0 
gtctgaacag gggtcctggt tctaccggaa gcgcaaccgg aacatggtgg tggaagtcca 34200 
ggaggaagtg ccagtcctgc cagacttctg ctggttgacg ctcggccagg tgctggctct 34260 
ccttcgtcag gacaacatcg tcaacatgga cacccggacg gtgctgtctt gcatcccgtt 34320 
ccacgattcc gccaccggac ccgaactagc cgcctcggag gagcccttcc gacaggcggt 34380 
ggccaggtcg ctctcgcacg gcatcgattc gtcgagtatc tccgaggcgg tcggttggtt 34440 
cgaggaagcc aaggcccgct accgcttgcg ggcaacgcgc gttccgctga gcagggtcga 34500 
caagtggtat cgcaccgata ccgagatcgc ccaccaggac ggcaagtact tcgcggtgat 34560 
cgcggtgtcg gtgtccgcga ccaatcgtga ggtcgccagc tggacgcagc cgatgatcga 34620 
accgcgagaa caaggtgaga tcgcactgtt ggtcaagcgg atcggcggag tgctgcacgg 3468 0 
tttggtccac gctcgggtgg aggctgggta taagtggact gcggaaatcg ctcccacggt 34740 
ccagtgcagt gtggccaact accaaagcac cccgtcgaac gactggccgc cgttcttgga 34800 
cgacgtgctc accgccgatc ccgaaaccgt gcggtacgaa tcgatcctgt ccgaagaagg 34860 
cggtcggttc taccaggcgc agaacaggta ccggatcatc gaggtgcatg aggacttcgc 34920 
ggcacgacct cccagcgact tccggtggat gactttggga cagttgggcg agctgctccg 34980 
gagcacccac ttcttgaaca tccaggcgcg cagcttggtc gcctccctgc atagcttgtg 35040 
ggcgttgggg cgatgaccag ctcgatgcga aagccggtgc gcatcggtgt gctcgggtgc 35100 
gcttccttcg cgtggcgacg gatgctgccc gcgatgtgcg acgtggccga aacagaggtg 35160 
gtggcggtgg cgagccgtga tccggcgaaa gccgaacggt tcgcagcgcg attcgaatgc 35220 
gaggcggtgc tgggttacca gcggctcctg gagcggccgg acatcgatgc cgtctacgtg 35280 
ccgttgccgc ctggcatgca tgcagagtgg atcggcaagg cgcttgaggc agacaaacac 35340 
gtgcttgcgg agaaaccgct gacgacgacg gcgtccgaca ccgctcgcct ggtcgggctg 354 00 
gccaggagga agaacctgct gctgcgggag aattacctgt tcctccacca cggccggcac 35460 
gacgtggtcc gcgacctgct gcaatccggg gagatcggtg agctccggga gttcaccgcc 3552 0 ' 
gtgttcggaa ttccgccgct tcccgacacg gacatccgct atcgcaccga actcggtggc 35580 
ggagcgttgc tggacatcgg tgtctatccc gcccgtgccg ctcggcactt tctcctcggt 3564 0 
ccgctcacgg ttctcggcgc aagctcgcac gaggcccagg agtcgggcgt cgacttgtcg 357 00 
ggcagcgtgc tgctccaatc ggaaggtggc accgttgccc acctcggata cggtttcgtg 357 60 
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caccactacc gcagcgcgta cgagctgtgg 
gcgttcacgc cgcccgccga gtggcaggcc 
gacgagttgt ccttgccagc ggaagatcag 
gacatcagag cagggacagg cgtggacgac 
atgatccagc aggccgcgct ggtggaggcg 
tagccgcccg gcatccgcgg gtagtagttc 
cagcggggaa gccgctggag aggctcaccg 
ctgatcgcgg acggctcgga gaagtgctcg 
gccagggcgg cgggaaagtg agccaatccg 
gccgcgcggt agctgtcccg gagtcgctcg 
gtgctgtggt agacgagcgt ggggaccacc 
gccagtcgtg cgcgccatcg aggttgctcg 
agctcgcggc cgaggaagcg ggtgacgagc 
acggcggtac gacccgcttc ggtgagaagc 
gcatgcccga ccaggccggc cagcaccggc 
atgtccagca gatcgatcgg caccgcacct 
gcgtaggtgg ggtgcagttc gccggcgaag 
cagatgccga ccgggcagtc ggtcgtcagg 
gcggcatcga gcatggcgtg tccctgcgcc 
gtaccgaggc cttcgtagtc ggtgatgacc 
atcagctcgg tctccggctc ggttccggtt 
ccgaggccgt gggtgcccac tgcgaaagtg 
atgttcggca ccagaacggt gccggagacg 
cggtagagga tttgccaggc cttggctgcg 
acgggccggg ccctgaggag cgtgcccggg 
tagaagggat catccgcggg tgcccgcaga 
gccatcagga ctgcttcttt cgagcctgca 
tggcgtaatc cggatgtttc cggtattccg 
gtccgtgctt cgccgcgatg tctcccaagt 
gctcgacgcc ggcgagctca tcgagtttcc 
tgcacctggt ctgtgccgtg aggtcggtga 
ccagcgccgt gaagctgttc aaggtatggg 
cttccgggag tcgaacccga gtgatgatgt 
ggattcggtg gacctcgcgg atcgcggcgg 
tcgcccatgc ggcctgaaag aaaccgtccg 
cgaactcaag actgccggcc agtccggtct 
gctgggcgtt cacggtgatt ccggtgactc 
cgggctgccc attgccgcag cccaggtcga 
tgtcgatgaa caggtcggtc agttggtcgg 
cctcccgata cccgcccgcc cagtaaccgt 
cgaagatgga ctccacctga tccgcggtcg 
tcactcgtcc tccgcgctgt tcacgtcggc 
cacccaagca ggaacgccgc cttcggcgtt 
gtaggtgtgg aggccgacca aattgcgttc 
cggctgctcc tcgtgctgaa cgccttccaa 
gggtgcggcc gggcactgcg cctgcccgcc 
ctcgatcacg tatacgccac cgctgcgcaa 
cacgtgtccg ttgatgtggc tgccatcgtc 



gggagtcgtg ggcgaatcgt cgtcgaccgg 35820 
gtgatccgaa tcgagcggaa gggcgttgtc 35880 
gttcgcaagg cggtcaccgc cttcgcacgc 35940 
cctgcggtgg ccggagattc gggcgaatcg 36000 
atcggtcagg cccgtcggtg cgggtccaca 36060 
gcctcgaagc ctgaccgggc atccggaagc 36120 
ccatccgctc acctggcatc tcgcggaccg 36180 
tcgaaccacg agacgaccac tcgcgagctg 36240 
gagagcggat gccaccgcac tggcgtaccc 36300 
ccgaatgcga acggaacgat ctcgtcgtcc 36360 
gggccaccgt tcctacctgc gacgctttcg 36420 
aaaaggccgg aagtgtcgag gaagtcgctc 364 80 
tccggtgcac cgagctcgcg cacttgatca 36540 
tcgtcgaatg gcagatcggg gtaggcagcg 36600 
ccggtgaaca ccccgtcatt tcggtggatg 36660 
gcggccgcag cgcggattcg cagttcaggt 36720 
gccgacgctt gcccaccctg cgcatagccc 36780 
ccggagcccg gtagccgttg cgcagcgcgg 36840 
ctgccgacgg tgtaggtgtg ggttccagga 36900 
acggcccacc cgcggtcgag ggccacggcg 36960 
cgaagcaggt acgacggggc aacttggcta 37020 
atgatggggc gatcttcgcg cggccacggg 37080 
gcgttcggca tgccaagggc ggagttggac 37140 
acgggttcgc ccgtgccgcg cagtgccgag 37200 
acacccggcg gtagcggcgt cggcggtcgg 372 60 
tcgtcgccga ccaggctggc gtgctcggag 37320 
ggagcatgaa acccatgctt tcctcgtttc 37380 
caaccgcggc gatcagctgt gctggtcccg 37440 
agcgttgctg gtaggtgccg acagccgcag 37500 
ggagcaactc gtcgacgtac caggagacca 37560 
cttcgagaat ctcgaacccg gcttcgctga 37620 
cggtcgtgcc cgtccaaacc gccgcgtact 37680 
ctccgaggac gaaccggccg ccgggttcca 37740 
cctggtccac gatctgcacg acggactgca 37800 
ggtagggcag ctgggcgccg tcgactagat 37860 
cgttggcgag cctggtggcg gcggcgagat 37920 
gaacgccgct ggcgcatgcc gcacggacta 37980 
acaggtgcgc tccgggacgg agcgcggcct 38040 
cagcatccga ccacggtgtg gcaccggcat 38100 
ggtgcagggg acgcccgtgc gccaacgcat 38160 
gaaatgcctg tgtgttcgcc cctctgctgt 38220 
caggtgcaat atgtcgtcca gactccttgg 38280 
gacgcctttc tccaggaacg cgatgttgtg 38340 
caggtagctc ggctcgtacg agcccgcatg 38400 
caggttcttg agcaggctga ccgtggtgcc 38460 
gaatccggga gcataggtcg tccacagatc 38520 
ccgggggaac agcgtttcca gggatgtgcg 38580 
gatgatgatg tcgaacggtc cgtacttgtc 38640 
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gtcaacggcg gccagctcct cgggcttgct ctggtcggcg cggacggtgc agagcctctg 38700 
ctggtcgagg aaggacttgt cgaaaacgtc catcccgaac acgaggccgc ggtggaagta 38760 
gcgcttccac atcttcaggg attcgccgcc gccaccgtcg aagttgtagc caccgacacc 38820 
gatctccagg atgcgcaccg ggcgatcacg gaactcgccg aggtgtcgct cgtatagcgg 38880 
ggtgaaccag tgcaggccgc cccacttgtc cgtgcggtag tgggaggc^a gcaagttgag 38940 
gtcgggacgt cggtgcccgc agccggcgac cactgcggag atggcctgga agccatcgga 39000 
cagttccgac ggaccgggta tcgaaccgga tgtggtggtt cggaggaagt tggtgctccg 39060 
ggcgccgacg gccctgggag ctcctgggcc gaacaactcg gcgatgagat cggtgagctc 39120 
gtaaccgatc cgcagcggga cgtctccgac cggtcgttgc tcggccttga tcagctcacc 39180 
ggactgtagc gtcaggacga agtcaacggt ctcgcctcgg tgggtgatct ggaccgcgac 3924 0 
ctcggtccgt tcgatgtcgg gggccggttc cgcgcggaag aggatctcgt cgatcagcac 3 9300 
gggtgcgatc ctggcgagtc cgagttcggt ggtcaggtcg gccaggctcg ccgcactgga 39360 
tccggcggcg aggatgatgc gttccacggt ttcgatctcg tgcgttgtgg acatcgtgat 39420 
gagctcctca tggctgaccg ggtgaaagcc gtgccggcgg tttgatcgac aggccgtgct 39480 
ggaagatgtt ctgcggatcc caccgcgctt tggcccgctg cagccgcggg tagttgtctt 3 9540 
tgtagtacag gtcgtgccag gcaacaccgg aggtgttcca caatggatcg gccaagtcgg 39600 
tgtccgggta gttgatgtag gagccgtcga cacgggtacc tggcaccgga actccgccgg 39660 
tttcggcgta catctcgcgg tagaaaccgc gaatccaggt cagatgccgc tcgtcctcgg 3 9720 
cgggctccga ccagttcgtg acgaacagcg ctttgagaac cgagtcgcgc tgagcgagtg 39780 
cggtggccga cggagccacg gcattcgcca taccgccgta accgagcagc aacagcgccg 39840 
ccgcagggtt gtcgtatccg tagacggtca gccgccggta aaccgtggct agttgagctt 39900 
cggacagccc ggtgcgcaag taggcggctt tgaccttggt ccgttgcatg cccggttcgc 39960 
cgccttcggc gatcgccccg gccacctggg tcgatcgcaa ccacggcagg gtttcccgca 4 0020 
gcccttcggc tggagtcacg ccgacctggg cgttgatcgc cgacaggtgt tcggccaggg 4 0080 
tgcgttccgc gttcggatcc gtgccgtcca ggtgaacgtt cagcgtgacg tagccagctt 4 0140 
gccggtgtgc gcagacgagc gtgctgaaca acccgagttg cgtggattca ggcgcgctgt 4 0200 
gctgctcgta ccaattgccg aagttctgta ggagcacggc gaatgactgc tctgtcagtt 40260 
cgtgccacgg ccagtggaac gatcggagca gcactgtcgc gggcggccgt ggcaggagct 4 0320 
ctgcggcgtc ggtgctgacc acgtccggcg ttcggagcca aaacctggtg acgatcccga 4 0380 
agttgccgcc accgccaccg gtgtgcgccc accacaagtc gtgaccggcg cccgtggagt 40440 
tccggtcggc ctcgacgatg tgcacttcac cggcctggtc gaccacgacg acctcgacgc 40500 
cttgaaggta gtcgacgacc gaaccgaatc ggcgcgacag cgggccgtat cccccgccga 4 0560 
ggatgtgccc gcctgcgccc accccgggac atgcgccggt cgggatcgtc acgccccagt 40620 
tcttgaacag ggttcggtac acctgcccga gggcggcgcc cgcctcgatc gcgaatgccc 40680 
cgcgcgtgct gtcgtagtac acgcggttga gctcggagag gtcgacgagc actcggatcg 40740 
ccgggtccgc aacgagattc tcgaagcagt gcccgccgct gcggacccct acccgcctgc 40800 
cggtgcgcac ggcgtcggcg acggcgtgca cgacgtcttc ggcggagctg gcgatgtgga 4 0 860 
tgcgttcggg ttttccggtg aaacgggggt tgtgcccgac gacgaggtcc ggataacgag 40920 
gatcgtcggg ctcgacggtg atctctgttc ctggggttcg acgattcatg ggtgccgggt 40980 
catggaattc gggcaccgcc cctccttttc tgactggtcc actttgttcg cccgcagccg 41040 
agatcatcta cgcgtccggg tgattatctg tgtgtttcag ctcatacgtg aaacccggtc 41100 
gcctccgccg gctctacttt gtggatcgat atcgcggtgc gcatggtgcc gtatgcgctg 41160 
gaaccgaaaa ggtgatgact ttccatgagt gagatcgcag ttgccccctg gtcggtggtg 41220 
gagcgtttgc tgctcgcggc gggtgcgggc ccggcgaagc tccaggaagc agtgcaggtg 41280 
gccggactgg acgcggtggc cgacgccatc gtcgacgaac tcgtcgtacg ctgcgatccg 41340 
ctgtcgttgg acgagtcggt gcgaatcggc ctggagatca cttctggcgc tcagctggtc 41400 
cggagaaccg ttgagctcga tcacgcaggc ctgcggctcg cggcggtcgc cgaagcagct 41460 
gctgttctcc ggttcgacgc ggtggatctg ctggaagggc tcttcggccc ggttgacggc 41520 
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aggcggcaca acagccgtga agtccgctgg 
cagggcctcg ccggcgcgca gcgcctgctg 
cacgccgtgc tggccgcagc cgccaccagg 
tacggatccg acaaatgggc ggacctgcac 
tcccgattcc aggatgcccc ggtgcgagtg 
cccgaactcg gtggtgcttc gctgcgcatg 
tacgggctgg acattttcga gaaagccggg 
ggtgaccaga gcgatgcgga attcctggaa 
attgtcatcg acgacggcag ccatgtcaac 
tttccgcacg tccgcccagg tggtttgtac 
cccggctacg gcggtcgcga tggggaaccc 
aaagaactga tcgacggcct gcattatcag 
tcctacacgg aacggaacgt ggcggccctg 
aaagggctca acgctgagcc tgccgcgccg 
gaggacggct gagccgttca ccagctgcgg 
tggatgggtt ccgtgatccc gagttccgcg 
ggcagaaaat agtcgtcgat gatgacgaat 
cgcaatgagt ccattgtgga ttcgtagagg 
tggatggggg cggtgggcaa ggtgtcccgg 
tcgagcagcc cgtagcgggc gaagttctgc 
tcgttgtact cgtgcagcgc catagcctgg 
ggcattccct ggaaggaatc cactacccag 
agaaccgcgc gcatgaagat gcatgcgccg 
ccgggaacac cgtctgcgag cacggcttcc 
agaccgatca tcgtgtgcgc gacagttggc 
ctgtagtcgg tgttgtcctg ccaggcgttc 
gtgagtacct tcttgagcag gtccaggtac 
gatccagctg ttctcgggtg actagttcat 
gtccgcacca gcgcggcggg atggggcatg 
gcagcagcga agccggtgtc gccgagcacc 
tcggactggg cctgctcatc cggcaagcag 
gcgtagtcga actggtcgaa gtactgggga 
gcggtgaatg ccgttcccga gccgcccgcg 
aggttgagcg ggaccgattc ggcgatccgg 
cggtgctcgg ggggaacggc gatcacggcc 
actgcgcgca gcagcggagc cggcccggtg 
cagacccgcc gtgctgaggt gcgcgccgcg 
tacggcacgt actggaccgg tgcgccttgc 
cagggatcga ggatgagctc gggagtgggc 
gggtcaagca actcgtgggc tcgatcgctg 
cggtgcagca cgaccggcag gtcgagcaat 
acgtcgacca acagcactga cggtcgccag 
tgatcgagcg agctttgcgc gacattggac 
tcggtgtcgc gctgaccgaa cgccggattg 
tcgccggtcc tgtcgttccc gcggatcccg 
gcctgcagct ctggtggtgc ggcgatcagg 
cacagcggca ccattgccat gagatgcgtc 
atacttcgga ccccagtctc tttcccccga 



tcggacagca tgacgcagtt ctcgcccgac 41580 
gcgttccgga acagggtgtc caccgcggtg 4164 0 
cgcgcggacc tcggtgcgct ggcagtccgc 41700 
tggtacaccg aacactacga gcaccacttc 41760 
ttggaaatag gaatcggtgg ttatcacgca 41820 
tggcagcggt acttccggcg aggtctcgtt 41880 
aacgaagggc accgagtgcg aaagctgcga 4194 0 
gacatggcgg ggaagatcgg gccgttcgac 42000 
gaccacgtca agaaatcctt ccaatccctg 42060 
gtcatcgagg atctccagac ggcgtactgg 42120 
gcggcccagc gcacctcgat cgacatgctc 42180 
gagcgcgaat cgcggtgcgg gaccgagccc 4224 0 
cacttctacc acaacctggt attcgtggag 42300 
gggttcgtgc cccggcaagc gctcggcgtc 42360 
cgccagtagg cgcccgtgcc gtcgatgtcg 42420 
cggaacccct tcaccgcgtc ctggcaggac 42480 
ccgcccggcg agagcttcgg gtacaggttc 42540 
tcgccgtcga gtcgtagcac ggcgagttcc 42600 
aaccagccgg ggaggaacct gacctgttcg 42660 
cggacggtct caagcgatac gccaagcacg 42720 
tccgcttggt ggtcttgcgc agagctttcc 42780 
acggtacgtc cggtatctcc gaatgcctgg 4284 0 
ccccgccaga caccggtctc ggcgaaatcc 42900 
acgcagtgct ggaggttgtc cagccgctcc 42960 
cagtccgtgc ctttggcccg agcggcctgc 4302 0 
ggatgcggcc gatcactgta aatcgtgttg 43080 
agcgcgttct gggagggcat cggttctccg 4314 0 
caggcacgga tggccgcagt gttctccagc 43200 
gccgtgatct cgtcgctgag tttgattgcc 43260 
gttgcgattg agtcggtgaa ctgttcgtgg 4332 0 
atgcccgccc cggcagcggc gaggttgcgc 43380 
agcacgagtt gcgggatgcc gagtcgggtc 4344 0 
cagatgacca gctcgcaggt acgcaggaac 43500 
gcgttgtcgg gtaggtcggt gagaagtgcc 43560 
tcgacgccgg gcaactcggt ggcagccgct 4362 0 
gcgttcagca ccatgcggcc catgcagatg 43680 
ccccatgccg ggaatgcgcc gcttccgttg 43740 
ggcgcgtcgc ttgcttgcag gctcggcgga 43 800 
aggccggtca gtccgtggtg ccggcacacc 43 860 
aaggggcctg cggtggggtc gactccccag 43 920 
ccgccgagca cccggccgat cagcgcgcag 43980 
gcctcggcca gtcgaaggta ttcggggagc 4404 0 
gcggtctgct cccacagttg ccggcctgcc 44100 
ggaaagcgca gctgcgtggt tccacccgta 44160 
gccgtggtga gacctgcacc atgcgcggtc 44220 
acctcgtgcc cggatgcttg cagcgcccag 44280 
ggatagggca agggaacgac gagtacgcgc 44340 
ttagcgcagc agcccctact cccattggcc 44400 
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aggatttgga aaatgcgctg cgtatgtcga tcgccgttga cgtccaacgg acttccggcg 44460 
gcaacaatag tgtgtcacgg caggaatgtc acgcgaccat cgaagatctt tgggtcgccg 44520 
cacctggttt cacgcgaacg agtgaaatgc gcgagctccg ctcgatcggg gtgggccgga 44580 
cctgtacggt gatcaccgtt ggttctgcgg ggattcatgg ggaagatttg cgctggctgt 44640 
ttgcctcctg gccggatagt tatagtcggt accgccgcat gcggcggtaa ccgcgaatta 44700 
actgacggct agtttgccgt cttttctctc tgtgtgtttc ctgctcggtt ccagaaaatt 44760 
acgagaaggt gaacgttgca gagatcaggc ataccggtgt tgccaggtgg cgcaccaaca 44820 
tcgcagcagg ttgggcagat gtatgacctg gtcacgccgt tgctgaactc ggtcgcgggc 44880. 
ggcccctgcg ccatccacca cggctactgg gagaacgacg ggcgggcttc ctggcagcag 44940 
gccgccgacc ggctcaccga ccttgtcgcc gaacggaccg tgctcgatgg cggcgttcga 45000 
ctgctcgatg tggggtgcgg taccggacaa ccagcgctgc gcgtcgcgcg cgacaacgcg 45060 
atccagatca ccggcatcac cgtcagccag gtgcaagtgg ccatcgccgc tgattgcgca 45120 
cgcgaacgcg gactaagcca ccgggtggac ttctcgtgcg tcgatgccat gtccctgccg 45180 
tacccggaca atgctttcga cgccgcctgg gccatgcagt cgctgttgga gatgtccgaa 4524 0 
ccggaccgtg ccatccggga aatccttcga gtactcaaac ccggtggcat cctcggcgtc 45300 
accgaggtcg tcaaacgaga agcgggcggc gggatgccgg tgtccgggga caggtggccg 45360 
accggccttc ggatctgcct ggctgagcaa cttctggaat cgctgcgtgc agcggggttc 45420 
gagatcctcg attgggagga cgtgtcgtcg aggacccggt acttcatgcc gcagttcgcc 454 80 
gaagagctcg ctgcgcacca gcacgggatc gcggacaggt acgggccggc tgtcgccggc 45540 
tgggccgccg cggtctgcga ttatgagaaa tatgcccacg acatgggcta tgcgattctg 45600 
acggcgcgga agccggtcgg ctgagggcgc gccgcaattc gatgacgttc atgcgccgtg 45660 
tcggggaatc gccggtggcg gcgccagcag aggctgaact tactggtggt gtgtccagga 45720 
atcggagggg cagtaccgaa tgagcgaagc cgggaacctg atagccgtca tcggactgtc 45780 
ctgccgccta ccccaggcgc ctgacccggc ttccttctgg cggttgctgc gcaccggaac 45840 
ggacgccatc accacggtcc cggaagggcg gtggggcgac ccgttgcctg gtcgggatgc 45900 
gcccaagggc ccggaatggg gtggtttcct ggctgatgtc gactgcttcg atcccgagtt 45960 
cttcgggatc tcgccgcgag aagcggcagc cgtggatccc cagcagaggc tggctctgga 46020 
gctcgcctgg gaggcactcg aagacgccgg tatccccgcc ggcgagctgc gcggtactgc 46080 
cgccggtgtg ttcatggggg cgatctctga cgactacgcc gccctgctgc gcgagagccc 4614 0 
gccggaagtg gctgcgcagt accgcctcac cggcacccat cgaagcctga tcgccaaccg 46200 
cgtgtcctat gtgctcggcc tgcgcgggcc aagcctgacg gtggattcag gtcagtcctc 46260 
gtccctggtc ggcgtgcatc tcgccagcga gagcctgcga cggggtgagt gcacgatcgc 4 6320 
actcgccggc ggcgtgaacc tcaacctggc cgccgagagc aacagcgctc tgatggactt 46380 
cggcgcgctc tccccggacg gtcgctgctt caccttcgat gtgcgggcga acggttacgt 46440 
ccgtggtgag ggcggcggcc ttgtcgtgct gaagaaggcc gatcaggcgc acgccgatgg 46500 
cgaccggatc tactgcctca tccgcggcag cgcggtcaac aacgatgggg gcggtgccgg 46560 
gctcaccgtt ccggcggcgg acgcccaggc ggagctgctg cgccaggcat accggaacgc 46620 
gggcgtcgac ccggccgccg tgcagtatgt cgagctccac ggcagcgcga ccagggtcgg 46680 
ggatcccgtc gaagcagcag ccctcggagc tgtcctgggg gcggcgagac ggcccggcga 46740 
cgagctgcgt gtggggtcgg cgaagaccaa cgtcggccat ctggaagcag cggcgggcgt 4 6 800 
caccgggttg ctgaagaccg cactcagcat ctggcaccgc gaactgccgc cgagtcttca 46860 
tttcaccgcc cccaacccgg aaatcccgct ggacgaattg aacctacgcg tccagcgtga 46920 
tctgcggccg tggccggaga gcgaggggcc gctgctggcc ggcgtcagcg ccttcggaat 46980 
gggaggcacg aactgccacc tggtgctctc cggcacgtcc cgggtggagc gacggcgcag 47040 
tggacccgct gaggcgacca tgccgtgggt cttgtcggcc agaacaccgg tcgcattgcg 47100 
tgcgcaggcg gcgcgcttgc acacgcacct caatacggcc ggtcaaagtc cgttggacgt 47160 
cgcctactca ctggcgacca ctcgatccgc gctgccgcac cgggccgcgc tggtcgcgga 4722 0 
cgacgaaccg aaactgctcg ccgggttgaa ggccctcgct gacggcgacg acgcgcccac 47280 
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gctgtgccac ggcgcgactt ccggcgagcg 
cagccagtgg atcgggatgg gtaggcagct 
gatgtcggac tgcgccgacg cattggcgcc 
gcgcaacgcg gccggcgctg cgcaccttga 
cgccatcatg gtctcgctcg cggagctctg 
cgtcgggcac tcgcaggggg agatcgcggc 
cgatgccgcc agggtggtgg cggtgcgcag 
cgcgatggcc tcgttgcagc atcccgccga 
cgatcggatc ggcgtggcgg gggtgaacgg 
ggaggcgatg gcggaactgc tggccgagtg 
tcccgttgaa tacgcctccc attcgcctca 
gctgttggcg ccggtcgaac ccaggacggg 
ggacctgctg gaccggccga tggacgccga 
gctgttcgaa gcggccgtcg aggccctgtt 
cagcccacac ccggtgctga ctgcgaacat 
ggtagtggcg ctcgggacac tccgccgcgg 
gctggccaga gcacacgtac acggagtggc 
cggggcgcag cgggtcgacc tgccgacgta 
cgcgaagctt cccgacgtcg ccatgcccga 
gctgcggtct tcgccgaggg cggacgtgga 
ggcagccgtg gtcctcggcc actccgatcc 
ggacctgggc ttcgattcct cgatggtggt 
aggtctgcga ctcgcaccga gcgtcgtttt 
ccaggtacgg acgttgttgt tgggcgagcc 
cgatgcggac gagcctatcg ccgtgatcgg 
ctcgcccgag gagttgtggc agttggtcgc 
ggctgaccga ggttgggacc tggagcgtgc 
cttgcatggc gccccggatt ttgaccccgg 
ggcgatggat ccacagcagc ggttgctgct 
cgggatcaac ccgcagcacc tgcacggaag 
cctggactac gggccacgcc tgcacgaagc 
cggcagcacc acgagtgtgg cgtcgggtcg 
tgcggtgacg gtggatacgg cgtgttcgtc 
gtcgttgcgt tcgggtgagt gtgatctggc 
gccggggatg ttcgtggagt tttcgcggca 
gtcgttcgcg gaggccgccg acggcaccgg 
ggagcggttg tcggatgccc ggcggaatgg 
tgcggtgaat caggacggtg cgtcgaatgg 
gcgggtgatt gcccaggcat tggcgagtgc 
ggaggcgcat gggacgggca cgcggcttgg 
cacctacggc cagggccggc ttccggaacg 
catcggtcac gcgcaggcag ctgcggggat 
gcggcacggg cagctaccgc gcacgttgca 
gtcggcgggg acggttcaac tccttacgga 
tcgtcgggtg ggggtgtcgt cgttcgggat 
acagcccccg ggagtgccga 



ggcagcggtc ttcgtctttc ccggacaggg 47340 
gctcgaaacc tccgaggttt tcgcggcgtc 47400 
gcacctggat tggtccctgc tggatgtgct 47460 
ccacgacgat gtcgtccagc ccgcgctgtt 47520 
gcgttcgtgg ggcgtgcgtc cggtggcggt 47580 
ggcctgcgtc gccggggccc tgtccgtccg 47640 
caggcttctg acggcgctgg ccggcagtgg 47700 
agaggtgcgg caaatcctgt tgccctggcg 47760 
accgtcgtcg accctggtgt caggggaccg 47 820 
cgcagaccga gagctccgga tgcgccggat 47 880 
catcgaggtt gtccgggatg agctgctggg 47940 
aagcatcccg atctattcga cgacgaccgg 48000 
ctactggtac cgcaaccttc gtcaaccggt 48060 
gaagcggggg tacgacgcat tcatcgagat 4 8120 
ccaggaaacc gccgtgcgag cagggcggga 48180 
cgaaggtggc atgcggcagg cgctgacgtc 48240 
cgcggactgg. cacgcggtct tcgccggtac 48300 
cgcctttcag cgacagcgct actggctgga 48360 
gagcgacgtg tcgacggcgt tgcgggaaaa 48420 
ctcgacgacc ctcacgatga tccgggcaca 48480 
gaaagaggtg gacccggatc ggacgttcaa 48540 
cgagctgtgc gaccgcctaa acgccgccac 48600 
cgactgtcct acgccggaca agctcgcccg 48660 
ggctcccatg acgtcacacc ggccggactc 48720 
gatgggctgt cggtttccgg gtggggtgtc 48780 
cgctgggcgg gacgtcgtgt ccgagttccc 48840 
ggggacatcg cacgtgcgcg ccggcgggtt 48900 
gttcttccgg atttcgccgc gcgaggcgtt 48960 
ggaaatcgcc tgggaagcag tcgaacgagg 49020 
tcaaaccggg gtcttcgtcg gcgcgacctc 49080 
gtccgaggag gcggccgggt acgtgctcac 49140 
ggttgcgtat tcgttcgggt tcgagggccc 4 92 00 
gtcgttggtg gccctgcatt tggcgtgtca 49260 
gttggccggt ggtgtgaccg tgatggccac 49320 
gcgtggtttg gcgccggatg ggcggtgcaa 4 93 80 
ctggtccgag ggtgctggcc tggttctact 4 9440 
gcatgaggtg ctggcggttg ttcgtggtag 49500 
tttgaccgcg ccgaatggtt cgtcgcagca 4 9560 
ggggttgtcg gtgtccgatg tggatgctgt 49620 
tgatccgatc gaggcgcagg cgctgatcgc 4 9680 
gccattgtgg ttgggctcga tgaagtcgaa 4 9740 
agccggcgtc atgaagatgg tgatggcgat 4 9800 
cgtggatgag ccgacttctg gggtggattg 4 9860 
gaacacgccc tggcccggga gtggtcgtgt 49920 
cagtggtact aacgcgcacg tcatcctcga 49980 

50000 
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<210> 2 
<211> 50000 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 2 

gtcagtctgc ggggccgggt tcgggctctg 
tgtcgggcaa aacacccgaa gcgctatccg 
acgagcgacc tgatgtctcc tcgctggatg 
cgctggatga gcgagcggtg gtgctggggt 
aagcgctgtc tgccggtcat gaggcttctg 
gccgcatcgg gtttgtgttt tccggtcagg 
tttaccgggc ttttccggtg ttcgctgctg 
cgcatctggg ccaggaaatc ggggttcggg 
tggatcggac gttgtgggcg cagtcgggtt 
tgctggattc gtggggggtt cggccgagtg 
cggcggcgtt cgcggcgggt gtggtgtcgt 
gtgcccggtt gatgcaggcg ttgccgtctg 
aggagctgtt gtggtcgttg ttggccgatc 
acgctgcggg gtcggtggtg ctctctggtg 
ggctggacgg gcaagggatc cggtcgaggt 
atcggatgga tccgatgctg gcggagttcg 
gttgtgaagt gccgatcgtg tcgaccttga 
gcgggcccga ctactgggtg cgtcaggtgc 
aggcgctggt cgagcacgat gtggccaccg 
cggcgctgat ccaggaatgt gtcgccgcat 
cggcgatgcg caggaaccag gacgaggcgc 
acgtacgtgg tggtgcggtg gactggcggt 
tcgagctgcc cacctacgcc ttccaacgac 
cttccgccgg cgacatgggc aggcgtgtcg 
aagatgtgga atcgcttgca cgcgtattgg 
ccctgagaag cgcccttccg gtgttggccg 
ttatggatca gcggtgttac cgaattggct 
gaactgtttt cggtacctgg ctggtcttcg 
tggtggactg cgttacggca ctgcgggcac 
atcccgaccc gacctccttc ggcgaccggg 
ttgttggcgt gttgtcaatg ttgtgcttgg 
tgtcacgggg ttttgcgttg accgtggagt 
ctgcccggtt gtggttgctg acgtgtggtg 
cagcgcaggc cctggcgtgg gggttggggc 
ggggcggctt gatcgatatt ccggtcttgt 
ttgtgctggc aggtctcgat gaggacgagg 
gtcggttggt acgccacact gtctcagctg 
cggtgctggt gacgggcggc acgggtggtt 
acgccggagc cgaacatgtg gcgatggtga 
agaagttgcg gacggaactg gaggatctgg 
tgaccgatcg cgaggcgctc gccgaagtgc 
ccgcggtagt gcatgcggca ggcgtgatcg 



tcgtggatgt tccggtggtg ccgtggatgg 60 
cgcaggcaac ggcgttgatg acctatctgg 120 
ttgggtactc gctggcgttg acacggtcgg 180 
cggaccgtga aacgttgttg tgcggtgtga 240 
ggttggtgac cggatctgtg ggggctgggg 300 
gtggtcagtg gctggggatg ggccgggggc 360 
cctttgacga agcttgtgcc gagctggatg 420 
aggtggtgtc cggttcggat gcgcagttgc 4 80 
tgttcgcgtt gcaggtgggc ttgctgaagt 540 
tggtgttggg gcattcggtg ggcgagttgg 600 
tgtcgggtgc ggctcggttg gtggcgggtc 660 
gcggtgggat gctggcggtg cctgctggtg 720 
agggtgatcg tgtggggatc gccgcggtca 780 
atcgggatgt gctcgatgac cttgccggtc 84 0 
ggttgcgggt gtcgcatgcg tttcattcgt 900 
ccgaattggc acgaaccgtg gattaccggc 960 
ccggagacct cgatgacgct ggcaggatga 1020 
gagagccggt ccgcttcgcc gacggtgtcc 1080 
ttgtcgagct cggtccggac ggggcgttgt 114 0 
ccgatcacgc cgggcggctg agcgcggtcc 1200 
agaaggtgat gacggccctg gcacacgtcc 1260 
cgttcttcgc cggtacaagg gcgaagcaaa 132 0 
agcggtactg gctgaacgcg ctgcgtgaat 13 80 
aagcgaagtt ctggggcgcc gtcgagcacg 1440 
gcattgtgga cgacggcgct gctgtggatt 1500 
gttggcagcg aacccgcacc accgagtcca 1560 
ggcggcaggt agccggactc ccgccgatgg 1620 
cgcctcatgg ctggtccagc gaaccggagg 1680 
gtggtgcctc ggtggtgttg gtggaagctg 1740 
tacgaaccct gtgttcgggc cttccggatc 1800 
aagaatcggt ccttccggga ttttctgcgg 1860 
tggtgcgggt tttgcgggca gctggtgcga- 1920 
gcgtgtcggt gggagatgta ccggttcgtc 1980 
gtgttgtggg gttggagcat ccggactggt 204 0 
tcgacgaaga cgctcaagag cggttgtcga 2100 
tcgcgatccg tcctgacggc atgttcgcgc 2160 
atgtgaagaa ggcgtggcgc cccaggggat 2220 
tgggggcgca cgttgctcgc tggctggccg 22 80 
gtcgacgcgg cgagcaggca ccgagtgctg 2 34 0 
gtacccgggt gtcgatcgtg tcatgcgatg 2400 
tgaaagccct tccggctgaa aacccgttga 2460 
agactggtga tgcggcggca atgagcctgg 2520 
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ctgatttcga tcacgtgttg tccgcaaagg 
tggccgatgt ggaattggac gcgttcgtct 
ctgggggaca cggggcttac gcagcggcga 
gtcggtcgcg agggctggtc gcgactgcgg 
tggcctccgg agaaacagga gaccagctgc 
agcacgccat cgccggaatc cggcaggccg 
ccgatgtcga ttgggcacgt ttcagcgcgg 
tgaacgaact ggccgaggtc aaggaactcc 
ttgccgacgc gtcgttggaa tggcggcagc 
aacagctgat cctggagctg gtacgcggcg 
cagcggccgt tgcatcggaa cgagccttca 
tcgaactccg cgttcggctc aatcgagcta 
tcagccatcc cacgcctgcg gaactggctg 
ccgcaggagc aggcattccc gaggaggacg 
tcccgttccc ggccttgcgc gaggcaggct 
acccggtcga ctccggtatc tcctcggacg 
atgtagccgg cctcgtcgaa gcagcgctgg 
ggagatgacg gtgaccacca gttacgaaga 
ggagaacgaa cgcctccggc gcggcaggga 
cgcgatcgtg gcgatgagtt gtcgttatcc 
gcaactggct gccggcggtg tggacgcgat 
cctggatggc gtgttcgttc cggactccga 
cggttttctt cagggcgtgt. cggagttcga 
ggcgctggcg atggatccgc agcagcggtt 
gcgggctggg ctggagcagc ggtcgacacg 
caatggccag gactacgcgt cgtggttgcg 
gctgacgggc ggtgcggcag cggttctttc 
gggtcctgcg gtgacggtgg atacggcgtg 
ggggcaagca ctgcgggccg gtgagtgcga 
gtcgacgccg aaggtgttcc tggagttctc 
gtgcaagtcg ttcgcggcgg gtgcggatgg 
gttgctggag cggttgtcgg atgcccggcg 
tggtagtgcg gtgaatcagg acggtgcgtc 
gcagcagcgg gtgattaccc aggcgttggc 
tgctgtggag gcgcatggga cgggcacgcg 
gatcgccacc tacggccgtg atcgtgatcc 
gtcgaacatc ggtcatacgc aagcggcggc 
ggcgatgcgg cacgggcagc tgccacgcac 
ggattggtcg gcggggacgg ttcaactcct 
tcgtgttcgt cgggtggggg tgtcgtcgtt 
cctcgaacag cccccgggag tgccgagtca 
ggatgttccg gtggtgccgt ggatggtgtc 
ggcaacggcg ttgatgacct atctggacga 
gtactcgctg gcgttgacac ggtcggcgct 
ccgtgaaacg ttgttgtgcg gtgtgaaagc 
ggtgaccgga tctgtggggg ctgggggccg 
tcagtggctg gggatgggcc gggggcttta 
tgacgaagct tgtgccgagc tggatgcaca 
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tggccggtgc cgcgaatctg gatgccttgt 2580 
tgttctcatc ggtgtcagga gtttggggcg 2640 
atgcctatct ggatgcgctc gcggaacagc 2700 
tggcctgggg gccgtgggcc ggcgagggcn 2760 
gccgatacgg cctttcccca atggctccge 2820 
tggaacagga cgaaatttcc ctggtagtgg 2880 
gattgctggc ggctaggccg cggccgctgc 2940 
tcgtcgatgc ccagcccgag gcgggagtcc 3000 
gattgtccgc ggcaccgagg ccgacacagg 3060 
aaaccgctct ggtgctggga caccccgggg 3120 
aggacagcgg attcgactcg caggccgcgg 3180 
ccggcctcca gttgccatcg acaattatct 3240 
cggagctgcg ggcgaggctt cttcccgagt 33 00 
aggcgcgaat cagagcggca ctgacgtcga 3360 
tggtgagtcc gctgctcgca cttgccggac 3420 
atgcggccgc gacctcgatc gatgcgatgg 34 80 
gcgaacgcga gtcctgagac cgccgacctg 3540 
agttgtcgag gcactgcgag catcgctcaa 3600 
tcggttctcc gcggagaagg acgatcccat 3660 
cggtcaggtc tcctcgccgg aggacctgtg 3720 
ctccgaagtt ccgggggatc gcggatggga 3780 
tcgtcctggc acgtcgtatg cctgcgcggg 3 84 0 
cgcgggtttc ttcgggattt cgccgcgtga 3 900 
gctgctggaa gtcgcgtggg aggtcttcga 3960 
cggttcccgc gttggcgtgt tcgtcggcac 4020 
gacgccgccg cctgcggtgg caggtcatgt 4080 
gggccgggtt gcgtattcgt tcgggttcga 4140 
ttcgtcgtcg ttggtggcgt tgcacctggc 4200 
ccttgccctt gccggtggcg tcacggtgat 4260 
ccgccaacgg ggtctcgcgc cggatgggcg 432 0 
cactggatgg ggtgagggtg ccggactgtt 4380 
gaatgggcat gaggtgctgg cggttgttcg 4440 
gaatggtttg accgcgccga atggttcgtc 4500 
gagtgcgggg ttgtcggtgt ccgatgtgga 4560 
gcttggtgat ccgatcgagg cgcaggcgct 4620 
tggccggccg ttgtggttgg ggtcggtcaa 4680 
gggtgtggct ggtgtgatca agatggtgat 4740 
gttgcacgtg gaatcgccgt cgccggaggt 4800 
tacggagaac acgccctggc ccaggagtgg 4860 
cgggatcagt ggtactaacg cgcacgtcat 4920 
gtctgcgggg ccgggttcgg gttctgtcgt 4980 
gggcaaaaca cccgaagcgc tatccgcgca 5040 
gcgacctgat gtctcctcgc tggatgttgg 5100 
ggatgagcga gcggtggtgc tggggtcgga 5160 
gctgtctgcc ggtcatgagg cttctgggtt 5220 
catcgggttt gtgttttccg gtcagggtgg 52 80 
ccgggctttt ccggtgttcg ctgctgcctt 5340 
tctgggccag gaaatcgggg ttcgggaggt 54 00 
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ggtgtccggt tcggatgcgc agttgctgga tcggacgttg tgggcgcagt cgggtttgtt 5460 
cgcgttgcag gtgggcttgc tgaagttgct ggattcgtgg ggggttcggc cgagtgtggt 5520 
gttggggcat tcggtgggcg agttggcggc ggcgttcgcg gcgggtgtgg tgtcgttgtc 5580 
gggtgcggct cggttggtgg cgggtcgtgc ccggttgatg caggcgttgc cgtctggcgg 5640 
tgggatgctg gcggtgcctg ctggtgagga gctgttgtgg tcgttgttgg ccgatcaggg 5700 
tgatcgtgtg gggatcgccg cggtcaacgc tgcggggtcg gtggtgctct ctggtgatcg 5760 
ggatgtgctc gatgaccttg ccggtcggct ggacgggcaa gggatccggt cgaggtggtt 5820 
gcgggtgtcg catgcgtttc attcgtatcg gatggatccg atgctggcgg agttcgccga 5880 
attggcacga accgtggatt accggcgttg tgaagtgccg atcgtgtcga ccttgaccgg 5940 
agacctcgat gacgctggca ggatgagcgg gcccgactac tgggtgcgtc aggtgcgaga 6000 
gccggtccgc ttcgccgacg gtgtccaggc gctggtcgag cacgatgtgg ccactgttgt 6060 
cgagctcggt ccggacgggg cgttgtcggc gctgatccag gaatgtgtcg ccgcatccga 6120 
tcacgccggg cggctgagcg cggtcccggc gatgcgcagg aaccaggacg aggcgcagaa 6180 
ggtgatgacg gccctggcac acgtccacgt acgtggtggt gcggtggact ggcggtcgtt 6240 
cttcgccggt acgggagcga aacaaatcga gctgcccacc tacgccttcc aacgacagcg 6300 
gtactggctg gtgccatcgg attccggtga tgtgacaggt gccggtctgg ccggggcgga 6360 
gcatccgctg ttgggtgctg tggtgccggt cgcgggtggt gacgaggtgt tgctgaccgg 6420 
caggatttcg gtgcggacgc atccgtggct ggccgaacac cgggtgctgg gtgaagtgat 64 80 
cgttgcgggc accgcgttgc tggagatcgc cttgcacgcg ggggaacgtc ttggttgtga 654 0 
acgggtggaa gagctcaccc tggaagcacc gctggtcctg ccggagcgcg gggcgatcca 6600 
ggttcagctg cgagtgggcg cgcccgagaa ttccggacgc aggccgatgg cgctgtattc 6660 
acgccccgaa ggggcggcgg agcatgactg gacgcggcac gccacgggcc ggttggcgcc 6720 
aggccgcggc gaggcggctg gagacctggc cgactggccg gctcctggcg cgctgccggt 6780 
cgacctcgac gaattctatc gggacctcgc agagcttggg ctggagtacg gcccgatctt 6840 
ccaagggctc aaggcggcct ggcggcaagg ggacgaggtg tacgccgaag ccgcgctgcc 6900 
gggaacggaa gattctggtt tcggggtgca tccggcactg ctggacgcgg ctctgcacgc 6960 
aacggctgtc cgagacatgg atgacgcacg cttgccgttc cagtgggaag gtgtgtccct 7020 
gcacgccaag gccgcgccgg ctttgcgggt ccgcgtggtc ccggctggtg acgatgccaa 7080 
gtccctgctg gtttgtgatg gcaccggtcg accggtgatc tcggtggacc gactcgtatt 7140 
gcggtcggct gcggcccggc ggaccggtgc gcgccgacag gcccatcaag ctcggttgta 72 00 
ccggttgagc tggccaacgg ttcaactgcc gacatccgct cagccaccgt cctgcgtgct 72 60 
tctcggcacc tcagaagtgt ccgctgacat acaggtgtat ccggacctcc ggtcgttgac 7320 
ggctgcgttg gatgccggtg ccgaaccacc cggcgtcgtc atcgcaccca cgccccccgg 73 80 
cggtggacga acagcggatg tccgggagac gactcggcat gcactcgacc tggtacaagg 7440 
ctggctttcc gatcagcgac tcaacgaatc ccgattgctc ctggtgacac agggagcagt 7500 
ggccgtggag ccgggcgaac ccgtgaccga tctggcgcag gccgcgctct ggggactgct 7560 
gcggtcgacg cagaccgaac accctgatcg cttcgtcctc gtcgatgtgc ctgagcccgc 7620 
gcaactcctc cccgcgctgc cgggggtgct ggcctgcggc gaacctcagc tcgcgttgcg 7680 
acgtggcggc gctcatgcgc ccagactggc tggactgggc agcgatgacg tcctgcccgt 774 0 
gccggacggc accgggtggc gattggaggc cacgcgcccg ggaagcctgg atgggttggc 7800 
attggtggac gaaccgacgg ccacggcacc gctgggtgac ggtgaggtca ggattgcgat 7860 
gcgcgcggcc ggggtgaact tccgggatgc gctcatcgcg ctcggtatgt atcccggtgt 7920 
ggcatcgctg ggcagtgagg gcgccggggt cgtggtggag accggccccg gcgtcaccgg 7980 
cctggcaccc ggcgaccgcg tgatgggaat gatcccgaag gcgttcgggc cgctcgcggt 8040 
cgccgaccat cgcatggtga cgaggattcc cgctggttgg agcttcgcgc gggccgcatc 8100 
ggtgccgatc gtctttctca ccgcctacta cgcgctggtt gatctcgccg ggttgagacc 8160 
aggggagtcg ttgctggttc attcggccgc cggtggggtg gggatggccg cgatccaact 8220 
cgccaggcac ctcggtgcag aggtgtacgc caccgctagc gaggacaagt ggcaagccgt 8280 
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ggagctgagc cgagaacacc tcgcttcgtc 
cggggcaacc ggcggacgcg gcgtcgacgt 
cgatgcgtct ctgcgaatgc tgccgcgcgg 
tgttcgtgac cccgtcgagg tcgccgatgc 
taccgtagag gcaggcccgc agcgaatcgg 
cgagggacgc gtgctggagc ccctgcctgt 
ggcgctacgg cacctgagcc aagcgcggca 
ggtgtgggac gccgcaggca cggttctggt 
ggtcgcccgg cacctcgtga tcgagcgcgg 
cggtcccgca gccagtggcg ctgctgagct 
ggtttccttg caggcttgcg atgtcgccga 
catcccggac gagcatccgt tgaccgccgt 
agtgtccgaa tcgctcaccg tggagcggct 
cgcgcggaat ctgctcgagc tgatcgaccc 
gtcgggtgtg ctcggcagcg gtgggcaggg 
cgcattggcg cagcaaaggc agtcgcgcgg 
ctgggcggaa catggcatgg ccagcacctt 
atctgggttg ctgccgatct cgaccgagga 
cggcgcgcat accgtggtgg cgccggttcg 
caagttctcc gtcctgcaag gtttggtcgg 
ggatgatgcc gagagcctcc ggaaacggtt 
gattctgctg gacctcgtcc gcatgcatgt 
ggagatcacc gcggacggca cgttcaaggt 
gttgcgcaac cggatcaacg gggcgacggg 
ctacccgacg ccggatgcgc tcgccgcgca 
ggccgggaca ttcgaggaac tcgacaggtg 
tgaggccacg cgggcgcaga tcaccacccg 
cgtgtccggc ggaaccggcg gcggctccgt 
cgagcttttc caactcctcg acaacgatct 
tcgcaacttc ctggagggaa acgccacatg 
cggcgtgcgc tcgtggatct gcaccaggcg 
gagcgggaac ccatcgcgat cgtggcgatg 
ccggaagggc tgtggaaact ggtcgcctcc 
gatcgtggtt ggcacctcga cgagctctac 
tacacccggc acggcggctt cctccacgac 
atcagccccc gtgaggcgct cgcgatggac 
tgggagaccg tcgaatccgc tgggatggac 
gtgttcgcgg gattgatgta cgagggctat 
gtcgaaggct atctcggaac cggcaatgcg 
gcgttcgggt tcgagggccc agcggtgacg 
gcgctgcatt tggcgtgtca gtcgttgcgg 
ggagtgacgg tgatgtcgac gccggagagg 
gcaccggatg ggcggtgtaa gtcgttcgcg 
ggtgccggtt tggtgttgct ggagcggctg 
ctggcggttg ttcgtggtag cgcggtgaat 
ccgaacgggc tggcccagga gcgggtcatt 
gcgtccgatg tggacgctgt ggaggcgcat 
gaggcgcagg ctctgatagc cgcctatgga 



gcggacgtgc gatttcgagc agcagttcct 834 0 
cgtgctcaac tccctcgccg gggagttcgc 84 00 
tggccgtttc ctggagttgg ggaagacgga 84 60 
gcatccgggc gtgtcttacc aggctttcga 852 0 
cgagatgctt cacgagctgg tggagttgtt 858 0 
cacggcttgg gacgttcggc aggcgcccga 8640 
tgtgggaaag ctggtgctca ccatgcctcc 8700 
taccggcgga acgggagcac ttggcgcaga 8760 
ggtgcgaaac ctggtcctcg tcagcaggcg 8820 
cgtggcgcaa ctgacggcct acggtgccga 8880 
tcgtgagacc ttggcgaagg tgcttgccag 8940 
ggtgcacgcg gctggtgttc tcgacgacgg . 9000 
ggaccaggtt ctgcgcccga aggtcgatgg 9060 
ggacgtggcc ctcgtgttgt tctcgtcggt 9120 
taactacgcg gcggccaact ccttcctcga 9180 
cctaccgacg agatcattgg cctgggggcc 9240 
gcgcgaagcc gagcaggatc gattggcgcg 9300 
ggggttgtcc cagttcgacg ccgcgtgcgg 9360 
attcagccgc ttgtccgacg ggaacgcgat 9420 
gccgcatcgc gtcaacaaag cggcgactgc 94 80 
gggacgcttg ccggatgcag aacaacatcg 954 0 
ggcggcagtg ctcggattcg ccggttctca 9600 
gctgggcttc gactcgttga ccgtggtcga 9660 
gctgcgactg cccgccaccc tggtgttcaa 9720 
cctcgtcacc gcgctgtccg cagaccgcct 9780 
ggcggcgaac ctgcccacgc tggccaggga 984 0 
gctacaggcg atcttgcaga gcctggcgga 9900 
gccggaccgg ctcagatcgg ccacggacga 9960 
cgaacttccc tgatgcctca gccggagcct 10020 
tcgaatgaag agaagctccg ggagtacttg 10080 
cgcgagcggc tgcacgaggc ggagtcggga 10140 
ggctgccggt acccgggtgg ggtgcaggac 10200 
ggtggcgacg ccatcggtga attccccgct 10260 
gatcccgacc cggatcagcc cggaacctgc 10320 
gccggcgagt tcgacgcggg attcttcgac 10380 
ccgcagcagc ggctgctgct ggaaatctcc 10440 
ccgaggtcct tgcgggggag ccgcaccggg 10500 
gacaccggcg cccaccgggc aggagaaggt 10560 
ggaagcgtcg cctctggtcg ggttgcgtat 10620 
gtagacacgg cgtgctcgtc gtcgttggtg 10680 
cagggcgagt gtgatctggc gctggccggt 10740 
ttcgtggagt tctcccgtca gcgtggtctc 10800 
gcggctgcgg atggaaccgg ttggggtgag 10860 
tcagacgcca ggcggaacgg gcatcgggta 10920 
caggacggtg cgtcgaacgg attgacggcc 10980 
cagcaggtgc tcacgagtgc ggggctgtcg 11040 
ggaacgggta cgcggcttgg tgatccgatc 11100 
caggatcggg accgggaccg gccgctgtgg 11160 
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ttggggtcgg tcaagtccaa catcggtcat acgcaggcgg ctgcgggcgt cgctggtgtg 11220 
atcaagatgg tcatggcgat gcggcacggg gagctgccgc gcacgttgca cgtggacgag 11280 
ccgaattcgc acgtggactg gtcggctggt gcggtccgac tcctgaccga gaacatccgc 11340 
tggccaggga cgggtacgcg ccgcgctgga gtgtcgtcgt tcggggtaag cggtaccaac 11400 
gcacacgtca tcctcgaaca cgacccgctc gccgtgaccg agaacgagga agcagcgcag 114 60 
tccccagcac ctgggatcgt gccctgggcg ttgtccgggc ggtcgtcgac ggcgctgcgg 11520 
gcccaggccg aacggctgcg cgagctgtgc gagcagaccg atcccgaccc cgtcgatgtc 11580 
ggtttctcac tggccgccac gcgcacggct tgggagcacc gagcggtggt gcttggtcgg 11640 
gacagcgcta cgttgcgctc cgggcttggc gttgttgcca gcggtgaacc agcggtcgat 11700 
gtcgttgagg ggagcgtcct ggacggcgag gtcgtcttcg tcttccccgg tcagggctgg 11760 
cagtgggccg gtatggcagt cgacctgctg gacgcttcgc cgacgttcgc gcgccacatg 11820 
gacgagtgcg ccaccgcgct gcggaggtac gtggactggt cgttggtcga cgtgctgcgc 11880 
ggagcggaga actccccacc gctggaccgg gtggacgtgc tccagcccgc gtccttcgcg 1194 0 
gtgatggtgt cgctcgccga ggtgtggcgt tcctacgggg tgaggccggc ggccgtcgtc 12000 
ggccacagtc aaggcgaaat cgccgcggcc tgcgcagccg gggtgctgcc gctggaggat 12060 
gcggccaggc ttgtcgcatt gcgcagcaga gcgttgaagg gactttcggg gcggggtggc 1212 0 
atggcgtcgc tggcctgccc tgcggatgag gtcgcggcat tgttcgcggg atcgggcggc 12180 
cgtctggaag ttgcggcgat caacggcccg cgatcggtcg tggtgtccgg cgatctggaa 1224 0 
gcggtggacg aactgctggc agagtgcgct gaaaaggaca tgcgtgcacg ccgtatcccc 12300 
gtcgactacg cctcgcattc agcgcacgtg gaggtggttc ggagcccggt gctggcggcc 12360 
gccgccgggg tgcgacaccg ggacggccag gtgccgtggt ggtcgacggt gatcggcgac 12420 
tgggtggatc cggccaggct ggacggcgag tattggtatc ggaacctccg gcagccggtc 12480 
cggttcgaac acgccgtgca gggcctggtc gagcggggat tcggcctgtt catcgaaatg 12540 
agtgcgcatc cggtgctgac cacggcggtc gaggaaaccg gtgcggagtc ggagaccgcc 12600 
gtggccgcgg taggtacctt gcgacgtgac tcgggcggcc tccggaggtt gttgcattcg 12660 
ctggccgagg cgtacgtgcg cggcgccacc gtggactggg ccgtggcgtt cgggggcgcg 12720 
ggccgacggc tggacctgcc gacctacccg ttccagcgcc agcggtactg gctggacaag 12780 
ggagctgcct ccgacgaggc tcgtgcggtc tcggacccgg cggcgggctg gttctggcaa 12840 
gccgtggcgc gccaagacct gaaaagcgtg tccgatgccc tcgatctcga cgccgacgca 12900 
ccgctgagcg caacacttcc agccctgtcc gtctggcacc gtcaggaacg agaaagggtc 12960 
ttggcagacg gttggcggta ccgagtcgac tgggtacggg tggccccgca gccggtccgg 1302 0 
agaacgcggg aaacctggct cctggtcgtt cccccgggcg gcatcgagga agcgctggtc 13080 
gaacggctga cggatgcgtt gaacacgcga gggatcagca ccctgcgcct cgacgtgcca 1314 0 
ccggcggcga ccagtggcga actcgcaacc gaactccgcg ccgcagccga cggtgacccg 13200 
gtgaaggcaa tcctgtcgct caccgcgttg gacgagcgac cccaccccga atgcaaggac 13260 
gtcccgagcg ggattgcctt gctgctgaac ctggtcaagg cgctcggtga agccgacctc 13320 
agaattcctc tgtggaccat cacgcgtggt gcggtcaagg caggccccgc agatcggctg 13380 
ctgcgcccga tgcaggcgca agcatggggt ctggggcgag tagccgcact cgaacacccc 13440 
gagcgctggg gtgggctgat cgacctgccg gattcgctgg acggcgacgt cctcacgagg 13500 
ctgggcgaag cgctcaccaa cggcttggcg gaagaccaac tggcgattcg ccagtcgggc 13560 
gtgctggccc ggcgactggt acccgccccg gcgaatcagc ccgctggacg taagtggcgc 13 620 
ccccgaggga gcgcgctgat cacgggcgga ctcggcgcgg tgggcgcaca ggtggcgagg 13680 
tggttggccg aaatcggagc cgagcgaatc gtgctcacca gtcgacgggg caaccaagca 13 74 0 
gcaggcgccg ccgagctgga agccgaactc cgggcccttg gagcgcaagt gtccatcgtg 13 800 
gcttgcgacg tgaccgatcg tgccgagatg tccgcactac tggccgagtt cgacgtcacc 13860 
gcggtgttcc acgcggccgg agtcggtcgg ctgctgccgt tggcggagac cgaccagaac 13920 
ggcctggccg aaatatgcgc ggcgaaggtc cgcggcgctc aggtgctgga cgaactgtgc 13980 
gacagcaccg atctcgatgc cttcgtcctg ttctcctcgg gtgccggggt atggggcggg 14040 
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ggcggtcagg gcgcttacgg cgcggcgaac gcattcttgg acacactcgc cgaacaacgc 14100 
cgagcacgcg gtctgccggc aacctcgatc tcctggggca gttgggccgg cggcggcatg 14160 
gccgacggcg cggcgggcga acacctgcgg cgacgcggga tacgtccgat gccggcggcg 14220 
tcggccatcc tggctctgca ggaagtactt gaccaggatg agacgtgcgt gtcgatcgct 142 80 
gatgtggact gggaccgatt cgttcccacg ttcgccgcga ctcgcgccac ccggttgttc 14340 
gacgaagtgc cggcggcgag aaaggcgatg cccgcgaatg ggccggcaga accaggcggc 144 00 
tcgccgttcg cccgcaatct cgcggagctg ccggaagccc aacgacgcca cgaactggtg 14460 
gatctggtgt gcgcccaggt ggcaaccgtg ctcgggcacg gcagtcgcga ggaagtccag 14520 
cccgagcggg cgttccgcgc gctcgggttc gactccctca tggcggtgga tctgcgcaat 14580 
cgtttgacca ccgccaccgg gttgcgcctg ccgaccacaa ccgtcttcga ctacccgaat 14640 
ccggccgcct tggccgctca cctgctcgag gagctggtgg gtgatgtcgc gtcggctgcg 14700 
gtgaccgctg ccagcgcgcc cgcgagtgac gaaccgatcg cgatcgtcgc gatgagctgc 14760 
cggtttccgg gtggcgcgca ctcgccggaa gacctgtggc ggctggtcgc cgccggcacg 14820 
gaggtgatcg gcgagttccc ctccgaccgg ggctgggatg cggaaggcct ttacgatccg 14880 
gatgcttcca ggcctggaac gacgtatgcg cggatggcgg gattcctcta cgacgccggt 14940 
gagttcgatg ccgacctgtt cggcatcagc ccacgtgagg cgttggcgat ggatccgcag 15000 
cagcggttgg tgctcgaaat cgcctgggaa gccctcgaac gggccggaat cgatccgttg 15060 
tccttgaagg gcagtggggt cggcacgtac atcggcgctg gaagccgtgg gtacgcgacg 15120 
gatgtgcggc agtttcccga ggaggcggag ggctacctgc tgacgggtac ctcggccagt 15180 
gtgctgtcgg gtcgggtcgc gtattcgttt ggtttcgagg gtcctgcggt gacggtggat 15240 
acggcttgtt cgtcgtcgtt ggtggcgttg catctggcgt gccagtcgtt gcgttcgggc 15300 
gagtgtgatc tggcgttggc cggtggtgtg accgtgatgt cgacgccgga gatgttcgtg 15360 
gagttctccc gtcagcgcgg tttggcgccg gatgggcggt gcaagtcgtt cgcggagagc 15420 
gcggacggca ccggctgggg cgaaggcgcg ggcctgttgt tgctggagcg gttgtcggac 154 80 
gcccaccgga atgggcatcg ggtgttggcg gtggttcgtg ggtcagcggt gaatcaggac 15540 
ggcgcctcga acggactggc ggcgccgaac ggtccgtcgc agcagcgggt gatcaaccag 15600 
gcactcgcga atgcggctct ttcggcgtcc gatgtggatg cggtggaggc acatggcacc 15660 
gggaccaggc tgggtgatcc gatcgaggcg caggcattga tcgcaacgta tgggcaggcc 15720 
cgggagcggg atcggccctt gtggctgggg tcggtcaagt cgaacatcgg tcatacgcag 15780 
gccgcggcgg gtgttgccgg tgtgatcaag atggtgatgg ccatgcggca cgggcagctg 15840 
cccgcctcgc tgcacgcgga tgagcccacg tcggaggtcg attggtcgtc gggggcggtc 15900 
cggctcctcg ccgaacaggt accttggccg gagtctgacc gtgttcgtcg ggtgggggtt 15960 
tcgtcgttcg ggatcagcgg caccaacgca catgtgatcc tcgaacaagc tacgaatgcg 16020 
ccagatagta cagcggagac ggacaaaaca gaatccggat ctactgtcga tattccggtc 16080 
gttccctggt tggtgtcggg aaagacgacg gattccctgc ggggacaagc cgaacgagtc 16140 
ttgtctcagg tcgagtcccg gccggagcag cgttcgctgg atgttgccta ctcgcttgct 16200 
tctggccgag ccgcgctgga tgaacgcgct gtcgtgctgg gtgcggaccg cggtgagctg 16260 
gttgctggac tggcggcgtt ggccgccggt caggaggctt ctggggtgat cagcggaact 16320 
cgtgcttctg ctcggttcgg gttcgtgttc tcggggcagg gtggtcagtg gttggggatg 16380 
ggcagagcgc tctactcgaa gtttccggtg ttcgctgctg cgtttgatga ggcttgcgcc 16440 
gagttggagg cacatctggg ggaagaccgc cgggttcggg atgtggtctt cggttccgat 16500 
gcgcagctgc tggatcagac gctgtgggcg cagtcgggtc tgttcgcgct gcaagccggc 16560 
ctcttggggc tgctgggttc gtggggcgtt cggccggatg tggtgatggg gcattcggtc 16620 
ggggagttgg ccgccgcgtt tgcggctggc gtgttgtcgt tgcgggatgc ggctcggttg 16680 
gtggccgcgc gcgcccggtt gatgcaagcc ctgccctctg acggcgcgat gttggcggtg 16740 
gctgctggtg aagaccttgt tcggccattg ctggccggtc gggaggagtc cgtgagcgtc 16800 
gccgcgctca atgcccccgg ttcggtggtg ttgtcgggcg atcgggaggt gctggccagc 16860 
atcgtcggcc ggctgaccga gctccgagtc cggacgcggc gcttgcgggt ctcccatgct 16920 
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tttcattcgc accggatgga cccgatgttg ggcgagttcg cccagatcgc cgagtctgcg 16980 
gagttcggta agccaacgac accgcttgtg tcgacgttga cgggtgagct cgacagagcc 17040 
gcggaaatga gcacaccagg gtattgggtg cgccaggcgc gtgaacccgt ccgtttcgcc 17100 
gacggtgtcc aggccctggc agcgcagggc ataggcacgg tcgtcgagct cggcccggac 17160 
ggaacgctgg cggcactggt tcgggagtgt gcgaccgagt ccgatcgggt tgggcggatt 17220 
tcgtcgatcc cactgatgcg cagggagcgg gacgagaccc gttcggtgat gacagccctg 17280 
gcgcatctcc acacccgtgg tggtgaggtg gactggcagg cgtttttcgc cggtaccggc 17340 
gctaggcagc tcgagttgcc aacgtatgcc ttccaacgac agcactactg gatcgagtcc 17400 
agtgcgcggc cagcacgcga ccgcgcagac atcggcgagg tggcggaaca gttctggacc 17460 
gcggttgacc aaggcgatct ggcaacgttg gtcgccgctc tggatcttgg ggcggacgac 17520 
gacacatgcg catcgttgag cgatgtattg .ccggcgttgt cctcctggcg aagcggactc 17580 
cgcaaccgtt cgctcgtcga ttcctgccgg taccgaatca gttggcattc ctctcgggag 17640 
gtgccggccc cgaagatttc cggtacctgg ctgttggtcg tgcccggtgc tgcggatgac 17700 
ggattggtca cggctttgac gagttcactg gtcggaggcg gcgccgaggt cgtccggatc 17760 
ggcctgtccg aagaggaccc gcaccgcgag gacgtcgcac agcggctggc caatgcgctg 17820 
acggatgccg gtcaactcgg tggcgtgctt tcgctgttgg ggctcgatga atcgcctgct 17880 
ccgggattct cctgcttgcc aactggtttc gcgctgactg tgcagcttct gcgggccttg 17940 
cggaaggccg acgtcgaggc gcctttttgg gcggtgacgc gcggcggcgt cgcgttggaa 18000 
gatgtacgcg tgtctccgga gcaggccctg gtctgggggc tgctgcgtgt cgcgggactg 18060 
gagcacccgg agttctgggg tggcttgatc gacctgccat cggactggga cgaccgattg 18120 
ggtgcccggt tggcgggtgt gttggcggat ggtggcgagg atcaagtcgc cattcgccgt 18180 
ggtggtgtgt tcgtgcggcg gttggaacgc gctggtgcgt cgggtgccgg gtcggtgtgg 18240 
cgtcctcggg ggacggtgtt ggtgacgggt ggtacgggcg gtttgggggc gcatgttgcc 18300 
cggtggttgg ccggtgccgg ggctgagcac gtggtgttga ccagccgtcg aggagcggac 18360 
gctccgggcg ctggggaatt gcgggcggag ctggaggcgc tgggtgctcg ggtgtcgatt 18420 
gtgccctgcg acgtggctga tcgtgacgca gtggctggag tgttggcagg gatcggtggg 184 80 
gagtgtccgc tgactgcggt ggtacacgcc gccggggtcg gcgaggcggg cgacgtagtg 18540 
gagatgggtt tggcggattt tgcagcggtg ttgtcggcga aggtgcgtgg tgcggcgaat 18600 
ctggacgagt tgctggccga ctcggagctg gatgcgtttg tgatgttctc ctcggtgtcg 18660 
ggggtgtggg gagccggcgg acagggtgcg tatgcggctg cgaacgccta cttggatgcg 18720 
ttggccgagc agcgtcgggc gaggggattg gtcgggaccg cggttgcgtg gggaccgtgg 18780 
gccggtgacg gcatggccgc cggcgaaacc ggcgcacagc tgcaccggat gggcctggcg 1884 0 
tcgatggaac cgagcgcggc gctgctggca cttcagggtg cattggaccg cgatgagacc 18900 
tccctcgtcg tggccgatgt cgattgggca cggttcgccc cagccttcac ctcggcacgt 18960 
cgacgcccgc tgctggacac catcgacgag gcccgagccg cattggaaac caccggcgaa 19020 
caagcgggca caggcaaacc cgttgagctg acgcaacgcc tggccggact gtcgcggaag 19080 
gaacgcgacg atgcggtatt ggatctggtg cgggcggaga cggcggctgt gctgggacgc 19140 
gacgatgcca cggccctggc gccatcgcgg ccgttccagg aactcggatt cgactccttg 192 00 
atggcggtgg agctgcgcaa ccggctgaac accgccaccg ggatccagct gcccgccagc 19260 
acgattttcg actaccccaa tgccgagtcg ctgtcgcgtc acctctgcgc cgagcttttc 19320 
ccaacggaga ctaccgtgga ctcggccctt gccgagctcg atcgaatcga gcagcagctc 19380 
tcgatgctca ccggcgaagc gcgggcacgg gaccgaatcg cgacacgact gcgagccctc 1944 0 
cacgagaagt ggaacagcgc agctgaagta ccgaccggag ccgatgtcct gagcacgctc 19500 
gattcggcga cgcacgacga gatattcgag ttcatcgaca acgagctcga cctgtcctga 19560 
gcagttcctg cggaacttca agcgccgaaa tcgggtggaa atcacaatgg ccaatgaaga 19620 
aaagctcttc ggctatctga agaaggtaac tgcggacctg catcagaccc ggcagcgcct 19680 
gctcgcggcc gagagccgga gtcaggagcc gatcgcgatc gtctcggcga gctgccgact 1974 0 
gcccggcggc gtcgactctc ccgaagcgct ctggcaactc gtgcgcactg gcaccgacgc 19800 
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catctcggag ttccccgccg accggggctg ggatctcggc cggttgtacg atcccgaccc 19860 
gaaccaccag ggaacgtcgt acacgcgggc cggcggtttc ctcgcaggag cgggcgattt 19920 
cgaccccg.cc atgttcggga tttcgccgcg tgaggcgttg gcgatggacc cgcagcaacg 19980 
gttgttgctg gagctgtcct gggaggccct .cgaacgggcg ggcatagacc cgacatccct 20040 
gcgcggcagc aagaccggtg tcttcggtgg tgtcacgccc caggagtacg ggccgtcctt 20100 
gcaggagatg agccgaaacg ctgggggttt tggactcacc gggcggatgg tgagtgtggc 20160 
gtcgggtcgg gttgcgtatt cgtttggttt tgagggtcct gcggtgacgg tggatacggc 20220 
gtgttcgtcg tcgttggtgg ccctgcattt ggcgtgtcag tcgttgcgtt ccggcgaatg 20280 
cgatctcgcg ctggccggcg gtgtgacggt gatggcgaca ccggcgacgt tcgtggagtt 20340 
ctcccgtcag cgtggtttgg ctccggacgg gcggtgcaag tcgttcgcgg ctgccgcgga 20400 
tggcaccggg tggggtgagg gtgccggtct ggtgttgctg gagcggttgt cggatgcgcg 20460 
gcggaatggg cacgaggttc tggcggtggt gcggggtagc gcggtgaacc aggacggcgc 20520 
gtcgaatggt ttgactgcgc cgaatggtcc gtcgcagcag cgggtgatca cccaggcgtt 2 0580 
ggcgagtgcg gggctgtcgg tttccgatgt ggatgcggtc gaggcacatg ggaccgggac 20640 
cacgttgggt gatccgatcg aggcacaggc cctgatcgcc acgtacgggc agggccggga 20700 
gaaggatcgg ccgttgtggt tggggtcggt caagtccaac atcggtcaca cgcaggcggc 20760 
cgctggcgtt gccggcgtca tcaagatggt cttggcgatg cggcacgggc agctgcccgc 20820 
cacgttgcat gtggatgagc ccacgtcggc ggtggactgg tcggcgggtt cggtccggct 2 0880 
tctcacggag aacacgccct ggccggacag tggtcgtcct tgccgggtgg gggtgtcgtc 20940 
gttcgggatc agcggcacca acgcacatgt gattctcgaa cagtctccag tcgagcaggg 21000 
cgaaccggcc gggccggtcg aaggcgagcg ggaaccggat gtagccgtcc ccgtggtgcc 21060 
ttgggtgctg tcgggtaaga caccggaggc tgcgcgggcg caggccgaac gggtgcattc 21120 
gcatatcgag gaccggccgg ggctgtcgcc ggtggatgtg gcgtattcgc taggaatgac 21180 
acgcgcggcg ctggatgaac gcgcagtggt gttgggctcg gaccgtgccg cgctcctgac 2124 0 
cgggttgagg gcattcgccg acggctgcga tgcgcccgaa gtggtttcgg ggtctgtggg 213 00 
gcttggtggc cgcgtcgggt tcgtgttctc gggtcagggt ggtcagtggc cggggatggg 21360 
ccgggggctc tactcggtgt ttccggtgtt cgccgacgcg ttcgacgagg cttgcgcgga 21420 
gttggatgca cacctgggcc aggaactgcg ggttcgggat gtggtgttcg gttcgcaagc 214 80 
gtggttgctg gatcggacgg tgtgggcgca gtcgggtttg ttcgcgttgc agattggctt 21540 
gctgcggctg ctgggttcgt ggggtgttcg gccggatgtg gtgttggggc actcggtggg 21600 
tgagctggct gcggtgcatg cggctggtgt gttgtcgttg tcggaggccg cgcggttggt 21660 
ggcgggtcgc gcccggttga tgcaggcgtt gccttctggt ggtgccatgc tcgcggtcgc 21720 
tacgggtgag tttcaggtcg atcctctgct ggatggggtg cgggaccgga tcggtatcgc 21780 
ggcggtgaat ggcccggaat cggttgtgct ctctggtgac cgcgagctgc tcaccgagat 2184 0 
cgctgatcgg ttgcacgatc aggggtgccg gacccggtgg ttgcgggtgt cgcatgcttt 21900 
ccattcgccc catatggagc cgatgctgga ggagttcgcc cagatctccc gaggccgcga 21960 
atatcacgca ccggaactgc cgatcatctc gaccctgatc ggtgagctgg acggtggtcg 22020 
agtgatgggc actcccgagt actgggtgcg tcaggtgcgt gagcccgtcc gtttcgccga 22080 
gggtgtccag gcgcttgtcg gtcagggtgt cggcacgatt gtcgaattgg gtccggacgg 2214 0 
ggcgttgtcg acgttggtcg aggagtgtgt ggcggaatcc gggcgggtgg ccgggatccc 22200 
gctgatgcgc aaggaccgcg acgaggcgcg aaccgtgctg gcagctttgg cgcagatcca 22260 
cacccgtggt ggtgaggtgg actggcggtc gtttttcgcc ggtaccgggg cgaagcaagt 22320 
cgacctgccc acctacgcct tccagcggca gcggtactgg ctggcatcca ccgggcgtgc 223 80 
gggtgacgtg accgccgccg gattggccga ggcggaccat ccgctgctcg gtgcggtggt 22440 
tgcgttggca gacggcgaag gtgtggtgct gaccggtcgg ttgacagcgg gttcgcatcc 22500 
gtggttgtcc gatcaccggg tgctgggcga aatcgtcgtc cccggcaccg cgatcgtcga 22560 
gctggtgtgg cacgtcggcg agcgcctcgg ttgtggccgg gtggaagaac tggctttgga 22620 
agcgcccctg atcctgccgg atcatggagc ggtccaggtt caggtgctgg tgggaccgcc 22680 
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cggggaatcc ggagcccggt cggtggcgct ctactcctgt cctggcgagg cgatcgaacc 22740 
cgagtggaag aagcacgcga cgggcgtgct tctcccaccc gtggccgccg agaaccatga 22800 
gctgaccgca tggcccccgg agaatgcgac cgaaatcgat gcagacgggg tctacgcatt 22860 
ccttgaaggg cacggtttcg cgtacggacc ggcctttaga tgtctgcgcg gtgcctggcg 22920 
acgaggcggg gaggtgttcg ccgaagtcgc attgccggat gacatgcagg cgggggtcga 22 980 
tcgattcggc gtccaccccg cgttgctgga cgcggttctg catgccgccg cagccgagac 23040 
gtcggtggtc cagagcgaag cgcgggtgcc gttctcgtgg cgtggggtgg aacttcgcgc 23100 
cactgaaagc gcggtggtgc gggcgcgcct ctcgttgact tcggatgacg aactgtcgtt 23160 
ggtcgcagtg gacccggctg gccgattcgt ggccacggtt gattcgctgg tgacccgacc 23220 
gatctcccgg cagcaggtga ggtctggcgc gatcggtgat tgcctgttcg aggtggagtg 23280 
gcaccggaag gcgttgttgg gaacaaccgc cggcgacgac cttgccatcg tcggtgacgg 23340 
tcccagttgg ccggaatcgg tgcgcgcaac cgcacggttc gcgaccctgg atgagttccg 234 00 
tgcggccgtg gactcggacg ttcctgcccc gggttcggtg ttggtcgcag ctatgtcggc 23460 
cgaagaggtc gagggtggat ccctgccgtc gcgcgcccaa gagtcgacct ccgatctgct 23520 
ggctctcgtg cagtcgtggc ttgcggacga gcggttcgcc gaatcccagc tcgtggtcgt 23580 
cacgcgtgca gcggtgtcgg ccgactcgga ttcggacgtc gcggacctgg tgggtgcgtc 236.40 
gtcgtggggg ttgttgagtt cagcccagtc ggagaacccg ggtcgcttcg tgctggtgga 23700 
cgtggacggc acacctgagt cgtggcaggc gttgccggcc gccgtgcgag caggagaacc 23760 
gcagctggca cttcggcgcg gcgtggcgct ggtgcctcgg ttggcgcgac tcacggtgcg 23820 
cgaggagggc tcctccccgc aactcgacac ggacgggacc gtcctcatca cgggtggcac 23880 
cggtgcgttg gggggagtgg ttgcccgtca cctggtggag gagcacggga ttcggcgttt 23940 
ggtgttggca ggccggcgtg gctggaatgc gcctggagtc cacgagttgg tggatgagct 24 000 
ggcgcgcgcg ggcgccgtgg ttgaggtggt ggcttgcgat gtggctgacc gcaccgatct 24060 
ggagcacgtg ctggccgcca ttccggtcga ctggccgctg cgggggatcg tgcataccgc- 24120 
tggggtgctg gccgacggag tgatcgggtc cttgtcggcg gcggatgtgg gcacggtgtt 24180 
tgccccgaag gtgacggggg catggcatct gcacgagttg acccgcgatc tggatctgtc 24240 
gttcttcgtt cttttctctt ccttctccgg gattgcgggt gccgcagggc aggccaacta 24300 
cgcggcggcg aacacgttcc ' tggatgcatt ggcgcgttat cgccgggcgc gtgggctgcc 24360 
tgggttgtcg ttggcgtggg gactgtgggc gcaacccagc ggtatgacga gtggcttgga 24420 
cgcggcgtcg gtggagcggt tggcgcggac gggcatcgca gaactttcca cggaggatgg 24480 
actccgcctg ttcgatgccg cgttcgcgaa ggaccgggct tgcgtcgttg ccgctcgatt 24540 
ggacagggcg ctgctggtcg ggaacggacg atcgcacgcg attccggcgc tgttgagcgc 24600 
gttggttcct gttcgcggcg gtgtggcgag gaaaacagcc aattctcagg ccgcggatga 24660 
ggacgcactg ttgggtttgg tgcgggagca cgtttcggcc gtgctgggtt attcgggtgc 24720 
ggtcgaggtt gggggcgacc gtgctttccg tgatctgggt tttgattcgt tgtctggcgt 24 780 
ggagttgcgg aaccgccttg ccggggtgct gggggtgcgg ttgccggcga ctgcggtgtt 24840 
cgactatccg acgccgcggg cgctggcgcg tttcctgcat caggaactgg caggcgaggt 24900 
cgcgtccacg tcgacgccgg tgaccagggc agcgagtgcc gaagaggatc ttgttgcgat 24960 
tgtcgggatg ggatgtcgtt ttccgggtgg ggtgtcgtcg ccggaggagc tttggcggct 25020 
ggtggccggc ggcgtggatg cggtggctgg gttcccagac gatcgcggct gggatctcgc 2508 0 
ggcgttgtac gatcctgatc ccgatcgtct cgggacctcg tatgtgtgtg agggcgggtt 25140 
tctgcgggac gcggcggagt tcgatgctga catgttcggc atcagcccgc gtgaggcgtt 25200 
ggcgatggat ccgcagcagc ggttgctgct ggaggtcgcc tgggaaacct tggagcgggc 25260 
tgggatcgat ccgttctcgt tgcacggcag ccggaccggt gtgttcgcgg gcttgatgta 2532 0 
ccacgactat ggggcccgat tcattaccag agcaccggag ggcttcgaag ggcacctcgg 253 80 
gacgggcaat gcggggagcg tgctgtcggg tcgggttgcg tattcgtttg gtttcgaggg 25440 
tcctgcggtg acggtggata cggcgtgttc gtcgtcgttg gtggcgttac acctggcggg 25500 
tcaagcactg cgggccggtg agtgcgaatt cgcccttgcc ggtggcgtca cggtgatgtc 25560 
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gacgccgacg acgttcgtgg agttctcccg tcaacggggt ctggctccgg atgggcggtg 25620 
caagtcgttc gcggcggccg cggatggcac cgggtggggc gagggtgccg gtctggtgtt 25680 
gctggagcgg ttgtcggatg cccggcgcaa tgggcacgag gttctggcgg tggtgcgggg 2574 0 
tagcgcggtg aaccaggacg gcgcgtcgaa tggcttgact gcgccaaatg gtccgtcaca 25800 
gcaaagggtg atcacccagg cactcacgag tgccgggctg tccgtgtccg acgtggatgc .25860 
tgtggaggcg catgggacgg gcacgcggct tggtgatccg atcgaggcgc aggcgttgat 25920 
cgctacgtac ggccgggatc gtgatcccgg tcggccgttg tggctggggt cggtgaagtc 25980 
gaatattggt cacacccagg cggcggcggg tgtcgctggt gtgatcaaga tggtgatggc 26040 
gatgcggcag ggggagctgc cgcgcacgtt gcacgtggac gagccctccg cgcaggtgga 26100 
ctggtctgcg ggcacggtcc aactcctcac ggagaacacg ccctggcccg acagcggtcg 26160 
tcttcgccgg gcgggcgtgt catcgttcgg gatcagtggc accaacgcgc acctgatcct 26220 
tgaacaacct ccgcgagagt cgcagcgctc aacagagccg gattcgggtt ctgtccgcga 26280 
ttttccggtg gtgccgtgga tggtgtcggg caaaacaccc gaagcgctat ccgcccaggc 26340 
agatgcattg atgtcctact tgagcaatcg cgttgatgct tccccgcgag atatcggtta 26400 
ttcgcttgcg gtgacccgtc cggcgttgga ccaccgcgct gtcgtgctgg gtgcggatcg 26460 
tgccgcgttg ctgccgggct tgaaagcgct ggccgttagt aatgacgctg ccgaggtgat 26520 
caccggcact cgtgccgctg ggccggtcgg attcgtgttc tccggtcaag gtggtcagtg 26580 
gcccgggatg ggaagcgggc tccactcggc gtttccggtg ttcgccgacg cgtttgacga 2664 0 
agcctgctgc gagctggatg cgcatctcgg gcagatggcc cggctacgag atgtgttgtc 26700 
cggttcggat acgcaacttc tggaccagac cttgtgggcg cagccgggcc tgttcgcgtt 26760 
gcaagtcgga ctctgggagt tgttgggttc gtggggtgtc cggcccgctg tggtgctggg 26820 
ccactcggtc ggtgagctgg cggcggcgtt cgcggctgga gtgttgtcgt tgcgggatgc 26880 
ggctcggctg gtggcgggcc gtgcccggtt gatgcaagcc ctgccaactg gcggtgccat 26940 
gctcgctgcg gctgctggag aggagcagct gcgcccgttg ctggccgact gcggtgatcg 27000 
tgtggggatc gccgcggtca acgctcccgg gtcggtggtg ctctccggtg atcgggatgt 27060. 
gctcgatgac attgccggtc ggctggacgg gcaagggatc cggtccaggt ggttgcgggt 27120 
ttcgcatgcg tttcattcgc atcggatgga tccgatgctg gcggagttca ccgaaatcgc 27180 
ccggagcgtg gactaccggt cgtcagggct gccgatcgtg tcgacgttga cgggtgagct 27240 
cgatgaggtc ggcatgccgg ctacgccgga gtattgggtg cgccaggtgc gagaacccgt 27300 
ccgcttcgcc gacggtgttg ctgcgctcgc ggctcacggt gtgagcaccg tcgtcgaggt 27360 
cggtccggat ggggtgttgt cggcgctggt gcaggagtgc gcggccggat ccgatcaggg 2742 0 
cggacgggtg gccgcggttc cgctcatgcg cagcaatcgc gacgaggcgc acacggtgac 27480 
aacggcattg gcgcagatcc atgtgcgtgg tgctgaggtg gactggcggt cgtttttcgc 2754 0 
cggtaccggg gcaaagcagg tcgagctgcc cacgtatgcc ttccaacgac agcggtactg 27600 
gcttgactca ccatccgaac cggtcgggca atccgccgat cccgcgcgcc agtcgggctt 27660 
ctgggaactc gtcgagcagg aagatgtcag cgcgctcagc gccgctctgc acattaccgg 27720 
cgatcacgac gtgcaggcgt ccctggaatc ggtggttccg gtcctctcct cctggcatcg 27780 
ccggatccgc a'acgaatccc tggtgcacca gtggcggtac cggatttcct ggcatgagcg 27840 
ggcagatttg ccagacccct cgttgtcggg gacatggctc gtcgtcgtgc cggaggggtg 27900 
gtcggcgagt cggcaagttc tgcgtttcaa cgagatgttc gaggaacggg gttgcccggc 27 960 
agttctgttc gagctcgccg ggcacgacga ggaagccctg gcgcaacgat tccgctcgtt 28020 
gcctgttgcg tcagggggaa taagcggcgt gttgtccttg ctggcgctgg atgaatcgcc 28080 
gtcctcgccg aacgctgctt tgccgaatgg cgcgctgaac tcgttggtac tgctgcgagc 2814 0 
tctgcgggcc gcggatgtgt cggcgccatt gtggttggcg acgtgtggtg gtgtcgcggt 28200 
cggggatgtg ccggtgaacc cggggcaggc gctggtgtgg ggactgggtc gcgtcgtcgg 28260 
tctggagcat ccggcctggt ggggtggcct ggtcgacgtg ccgtgcttgc tcgatgagga 28320 
cgctcgagaa cgcttgtcgg tcgtgttggc aggtcttggc gaggacgaga tcgcggtacg 28380 
tcccggtggt gtgttcgtgc ggcggttgga acgcgctggt gcggcgtcgg gtgccgggtc 2844 0 
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ggtgtggcgt cctcggggga cggtgttggt gacgggtggt acgggcggtt tgggggcgca 28500 
tgttgcccgg tggttggcgg gtgccggggc tgagcatgtg gtgttgacca gccgtcgagg 28560 
cgcggcggct ccgggcgctg gagatttgcg ggcggagctg gaggcgctgg gcgctcgggt 28620 
ttcgatcacg gcctgcgacg tggccgatcg tgacgctttg gccgaagtgt tggcgaccat 28680 
tccggatgat tgcccgctga ccgcggtgat gcatgcggcg ggggtcgttg aagtcggcga 28740 
cgtggcgtcg atgtgtttga ccgacttcgt tggggtgctg tcggcgaagg caggtggtgc 28800 
ggcgaatctc gatgagttgc tcgccgatgt cgagctggat gccttcgtgc tgttctcatc 28860 
cgtctcgggt gtgtggggtg ctggcgggca gggcgcttat gcggcggcga atgcctactt 28920 
ggatgcgttg gcgcagcagc gtcgggcaag ggggttggtg gggactgcgg ttgcgtgggg 28980 
cccgtgggcc ggtgacggaa tggccgcagg tgaaggcggt gcacagctgc gccgggccgg 29040 
cctggtgcca atggctgcgg atcgggcgtt gctggcactt cagggcgcat tggatcgtga 29100 
cgagacatcc ctggtcgtgg ccgatatggc gtgggagagg ttcgccccgg tgttcgccat 29160 
gtcccgtcgg cgtccgctgc tcgacgagct gcccgaagca cagcaggcgt tggcggatgc 29220 
ggagaacacc actgatgctg cggactcggc cgtcccgcta ccgcggctcg cgggcatggc 29280 
agccgccgaa cgccgccgcg cgatgctgga cctggtgctg gcggaggcct cgattgtgtt 29340 
gggacacaac gggtctgacc cagttggtcc cgaccgggcg ttccaggagc tcggatttga 294 00 
ttcgctgatg" gccgtcgaac tgcgcaacag gttggg.cgag gcaacaggat tgagtctgcc 29460 
ggccacgttg atcttcgatt atccgagccc atccgcgctg'gctgagcagc tggtcggcga 29520 
gctggtggga gcgcagcccg cgaccaccgt cgtggccggg gccgatccag tggatgatcc 295 80 
ggttgtcgtg gtcgcgatgg gatgccggta tccgggcgac gtctgctcgc ccgaggagct 29640 
gtggcagctg gtttctgcgg gacgtgatgc ggtatcgacg ttccccgtcg atcggggttg 29700 
ggactgcaac acgttgttcg acccggatcc ggatcgggca ggcagtacct atgtgcgaga 29760 
aggtgccttc ctgaccggtg ctgatcggtt cgacgccggg ttcttcggca tcagccctcg 29820 
cgaggcgcgc gcaatggatc cgcagcagag gttgttgctc gaagtggcgt gggaggtttt 29880 
cgaacgagca ggaatcgctc cgctgtcgtt gcggggtagc aggaccggtg tgttcgcggg 29940 
gaccaatggg caggaccacg gtgcgaaagt ggctgccgcg ccggaggcgg cgggtcacct 30000 
cctgaccgga aacgccgcga gtgtcctggc cggccggctt tcctacacgt tcggccttga 30060 
ggggcctgcg gtggcggtgg ataccgcgtg ttcgtcgtcg ttggtggcgt tgcatttggc 30120 
gtgccagtcg ctgcgttcgg gtgagtgtga tatggcgttg gcaggtggtg tgacggtgat 30180 
gtcgacaccc ctggctttcc tcgagttctc tcgtcagcgc ggtttggcgc cagatggtcg 3024 0 
gtgcaagtcg tttgcggccg ctgcggatgg caccgggtgg ggtgagggtg ccggcctggt 303 00 
gttgctggag cggttgtcgg atgctcgtcg gaatggtcac cgggtgttgg ccgtggttcg 30360 
cgggtctgcg gtgaatcagg atggtgcgtc gaatggcctg actgcgccga atggtccgtc 30420 
gcagcagcgg gtgattcggc aggccctcgc gaatgcgggg ctgtcggcgt ccgatgtgga 304 80 
tgtcgtggag gcgcacggga ccggtaccgg gctcggggat ccgatcgagg cgcaggcgct 30540 
gatcgcgaca tatgggcagg agcgggatcc tgagcgggcc ctgtggctgg ggtcgatcaa 30600 
gtccaacatc ggccacacgc aggcggcggc cggtgtggcg ggggtcatca agatggtgca 3 0660 
ggccatgcgg cacggggagt tgcctgcgac gttgcacgtg gacaagccca ctccacaggt 30720 
ggactggtct gccggggccg ttcggctcct caccgggaac acgccctggc ccgagagcgg 30780 
ccgtcctcgt cgagcggggg tgtcgtcgtt cgggatcagc ggcaccaacg cacacctcat 30840 
cctcgaacaa ccaccgtcgg aaccagcgga gatcgaccaa tcggatcggc gggtcactgc 30900 
gcatccagcg gtgatcccgt ggatgttgtc ggctaggagt ctcgcagcgc tgcaggccca 30960 
agcggctgcg ctgcaggccc ggctggaccg gggtcctggc gcttctccgc tggatttggg 31020 
gtattcactc gcgaccactc gttctgtgct ggacgaacgc gccgtcgtgt ggggtgccga 31080 
tcgggaggca ctgctgtcca ggctggcagc gctcgccgat ggccggacgg cgccgggggt 31140 
gataacgggc tctgcgaatt ccggtggccg catcggattc gttttttccg gtcagggcag 31200 
tcagtggctg gggatgggaa aggcgttgtg cgcggctttc ccggcgttcg cggacgcctt 31260 
cgaggaagcc tgcgacgcgc taagcgcaca cctgggcgcg gacgttcggg gtgtgctgtt 31320 
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cggtgctgat gagcagatgc tcgaccggac gctgtgggcg cagtcgggga tcttcgcggt 31380 
tcaagtcggc ctcctgggat tgctgaggtc gtggggcgtg cggccggccg cggtgctggg 3144 0 
gcactcggtc ggcgagttgg ctgcggcgca cgcggctggt gtgttgtcct tgccggacgc 31500 
tgcacggttg gttgcggctc gggcccacct gatgcaggca ttgcccaccg gcggcgcaat 31560 
gctcgcggtc gccaccagcg aggcggcggt. cggaccgctg ctttccgggg tgtgcgatcg 31620 
ggtcagcatc gctgcgatca acggccccga gtcggtagtg ctctccggcg accgcgatgt 31680 
gctcgtggag ctcgcaggcg aattcgatgc ccgagggctt aggaccaaat ggttgcgggt 31740 
ctcccatgct ttccactcgc accggatgga accgattctg gacgagtacg cggaaaccgc 31800 
caggtgcgtc gagttcggtg aaccggtggt gccgatcgtc tccgccgcga ccggtgcgct 31860 
ggacaccacc ggactgatgt gcgcggccga ctactggacg cgccaagtgc gtgatcctgt 31920 
ccgcttcgga gacggtgtcc gggcgctcgt cggccaaggc gtggacacga tcgtcgagtt 31980 
cggcccggac ggggcgttgt cggccctggt cgagcagtgc ttggccgggt ccgaccaggc 32040 
tgggagggtg gcggcgatcc cgctgatgcg cagggaccgc gatgaggtcg agaccgcggt 32100 
ggcggccctg gcgcacgtgc acgtccgcgg tggtgcggtg gactggtcgg cttgcttcgc 32160 
cggcaccggc gcccgcaccg tcgagttgcc cacctacgcc ttccaacgcc agcggtactg 32220 
gctggccggg caagcggacg ggcgcggcgg cgatgtggtt gccgacccgg tcgacgcgcg 32280 
cttctgggag ttggtcgagc gcgccgatcc ggaaccgttg gtggatgaac tctgcatcga 32340 
ccgggaccag cccttccggg aggtgctgcc cgttctggct tcctggcgcg agaaacaacg 32400 
ccaggaggcc ctcgcggatt cctggcgcta ccaggtgcgc tggaggtccg tcgaggtgcc 324 60 
gtccgcagcc gccctccggg gcgtgtggct ggtggtgctt ccagctgacg tgccccgaga 32520 
tcaaccggcg gtcgtcatcg acgcgctgat cgcgcgcggc gccgaggtcg cggtcctgga 32580 
attgaccgag caggacctcc aacgcagtgc gcttgtggac aaggtgcgcg ccgtcattgc 32 64 0 
ggaccgcacc gaggtgacgg gtgtgttgtc tctgttggcg atggacggca tgccctgcgc 32 700 
ggcgcatccg cacctgtccc gtggtgtcgc cgctaccgtg atcctgacgc aggtgttggg 32760 
cgatgcgggt gtttccgccc cgctgtggct ggccacgacc ggtggcgtcg aggccgggac 3 2 820 
cgaggacggt ccggccgatc cggaccacgg cttgatctgg gggctcggca gggtcgtcgg 32 880 
ccttgaacat ccgcagtggt ggggtggcct gatcgacctt ccggagacac tggacgagac 32940 
gtcccggaac gggttggtgg ccgcactcgc cgggacggcg gccgaagatc agctcgccgt 33000 
gcgttcatcc gggttgttcg ttcgcagagt ggtgcgcgca gcgcggaacc cccggtcaga 33060 
gacatggcgt agccggggaa cggtcctcat cacgggcgga acaggcgcgc tcggtgccga 33120 
ggtcgcacga tggctggccc ggcggggagc tgagcacctg gtgttgatca gtcgccgcgg 33180 
cccggaagct cccggcgcag cggacctagg ggccgagctg actgaactcg gcgtgaaagt 33240 
cacagtcttg gcctgcgatg tgacggaccg cgacgagctg gcggcggtgc tggcggccgt 33300 
tcccacggag tatccgctgt cggcggtcgt gcacaccgcc ggcgtcggga cgcctgcgaa 33360 
cctggccgag acgaccttgg cgcagttcgc cgacgtgttg tcggccaagg tcgtcggcgc 33420 
ggcgaacctg gaccggctgc ttggcgggca accgttggac gccttcgtgc tgttctcctc 334 80 
gatctcggga gtttggggag ccggcggcca aggagcctat tcggccgcca atgcgtatct 33540 
cgatgccctt gccgagcgcc gacgggcttg cgggcggccg gcgacgtgca tcgcctgggg 33600 
tccgtgggcg ggtgcgggca tggccgttca ggaaggtaac gaggcgcatc tccgccgaag 33660 
gggcctggta ccgatggaac cgcagtcggc cctcttcgcg ctgcaacagg ccctgtccca 33720 
acgagaaacc gccatcaccg tcgcagatgt ggactgggag cgattcgccg cctctttcac 337 80 
cgcggcccgc ccgcgaccac tgttggaaga gatcgtggat ctacggcccg acaccgagac 33 840 
cgaggagaag cacggtgccg gcgagctggg gcagcagctg gccgcactgc cgcccgctga 33900 
gcgcggacac ctgctgctgg aggtggtgct ggcggaaacc gccagcaccc tggggcacga 33960 
ttcggcggag gctgtgcaac ccgatcggac cttcgccgaa ctgggcttcg attcgctgac 34020 
cgcggtagag ctgcgcaaca ggttgaacgc ggtgaccggg cttcgcctgc cgccgacgct 34 080 
ggttttcgac cacccgacgc cgctggcgtt gtccgaacag ttggttccgg ccctggtcgc 34140 
ggagccggac aacggcatcg aatcgctgct cgccgagctc gacaggctgg ataccacgtt 34200 
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ggcgcaaggg ccttcgatcc cactggaaga ccaggccaag gtggcggagc gcttgcacgc 34260 
actcctcgcc aagtgggacg gggcgcgtga cggcacggcc agagcgacgt caccccaatc 34320 
gctgacggcg gccacggacg acgaaatctt cgacctcatc gaccggaagt tccggcgctg 34380 
accgcccttt cctcgcctca gctcccctga ttactggaac ggtgtatttc gatggccaat 34440 
gaagaaaagc tccgcgagta cctcaagcgt gtcgtcgtcg aactggaaga r ggcgcacgaa 34500 
cgcctgcacg agttggagcg ccaggagcac gaccccatcg cgatcgtgtc gatgggatgt 34560 
cgttatcccg gtggcgtctc cactccggag gagctgtggc gactggtcgt cgacggagga 34620 
gacgcgatcg cgaacttccc cgaagaccgt ggctggaatc tggacgagct gttcgatcct 34680 
gatccgggcc gagccgggac ctcctacgtc cgcgagggtg gtttcctgcg cggggtcgcg 34740 
gacttcgatg ccgggctctt cgggatcagt ccgcgcgagg cacaggcgat ggacccgcaa 34800 
cagcggttgc tgctggagat ctcgtgggag gtgttcgagc gcgccggcat tgacccgttt 34 860 
tctttgcggg gtaccaagac cggtgtgttc gcgggcctga tctaccacga ctacgcgtcg 34920 
cggtttcgca agacccccgc ggagttcgag ggttacttcg ccaccggcaa cgcgggcagc 34 980 
gtcgcatccg gccgggtggc ttacaccttc gggttagagg gcccggcggt caccgtggac 35040 
accgcctgct cgtcgtccct ggtggcgctg cacctggcct gccagtccct gcggctgggc 35100 
gaatgcgacc tggccctggc cggtggcatt tcggtgatgg ccacgccggg agccttcgtc 35160 
gagttcagcc ggcaacgcgc actcgcctcg gatggccggt gcaagccctt cgcggatgcc 35220 
gccgacggca ccggctgggg cgagggcgcc ggaatgctgc tgctggaacg gctgtcggac 352 80 
gcacgacgaa acggccaccc ggtgctggcg gcggtggtcg gttccgcgat caaccaggac 35340 
gggacgtcca acggcctgac cgcgcccagc ggtcccgcac agcagcgagt gatccgccaa 354 00 
gccctggcga acgccgggtt gtcgcccgcc gaggtcgatg tggtcgaggc gcacggcacg 35460 
ggcacggcct tgggcgaccc gatcgaggcg caggccctga tcgccaccta cggggcgaac 35520 
cggtcggcgg atcatccgct gctgctgggt tccctcaagt cgaacatcgg ccacacccag 35580 
gctgccgccg gtgtggccgg ggtgatcaag tcggtcctgg ccatcaggca ccgggagatg 35640 
ccccgcagcc tgcacatcga ccagccatcg cagcacgtgg actggtcggc gggcgcggtg 35700 
cggctgctca cggacagcgt tgactggccg gatctcggca ggccgcgccg agcaggggtg 35760 
tcctcgttcg gcatgagcgg taccaacgca cacctgatcg tcgaggaagt atccgacgag 35820 
ccggtctcgg gcagtaccga gccgaccggg gcatttccct ggccgctgtc cggcaagacg 35880 
gagacggcat tgcgcgagca ggctgccgag ttgctctccg tagtgaccga gcacccggag 35940 
ccgggactgg gggacgtcgg gtactcgctg gccaccggtc gcgctgcgat ggagcaccgg 36000 
gctgtcgtgg ttgccgacga tcgggactct ttcgtcgccg gactgacggc gttggctgcg 36060 
ggcgttccgg cagccaacgt ggtgcagggc gcggccgact gcaagggaaa ggtcgcgttc 36120 
gtgttccccg gccagggctc gcattggcag gggatggcga gggaactgtc cgaatcctcg 36180 
ccggtgttcc ggcggaagct ggcggaatgc gcggcggcta cggcccctta cgtggactgg 3624 0 
tcgctgctcg gcgtccttcg cggtgatccc gatgcacccg cgctggatcg cgacgacgtg 36300 
attcagctcg cgctgttcgc catgatggtg tcgctggccg aactgtggcg ttcgtgcgga 36360 
gtggagcccg ccgcggtggt cggtcattcc cagggcgaga tcgccgccgc ccatgtggca 3 6420 
ggcgctttgt ccttgactga tgcggtgcgc atcatcgctg cccgctgcga tgcggtgtcg 36480 
gcgctgaccg ggaagggagg catgctcgcg attgccttgc cggaaagcgc ggtggtgaag 36540 
cgaatcgcag gcctgccgga gctgaccgtt gcggcggtca acggacccgg ctccactgtc 36600 
gtttccggcg aaccgtcggc tctggagcgt ctgcagaccg aactgaccgc ggaaaacgtg 36660 
cagacccggc gggtgggaat tgattacgcc tcgcattcgc cgcagatcgc gcaggtccag 36720 
ggccggcttc tggaccggct gggcgaagtc gggtccgaac ctgctgagat cgctttctac 36780 
tcgacggtca ccggcgagcg gacggacacc ggccgactcg acgccgacta ctggtaccag 36840 
aaccttcggc agcccgtccg cttccagcag accgtcgccc ggatggcaga tcagggctat 36900 
cggttcttcg tcgaggtgag cccgcacccg ctgctcaccg ccggaatcca ggaaacgctg 3696 0 
gaagccgcgg acgcgggcgg ggtggtggtc ggttcgctgc ggcgtggcga gggcggctcc 37020 
cggcgctggc tgacttcgct ggccgagtgc caggtgcgcg gactgccggt gaattgggaa 37080 
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caggtattcc tcaacaccgg agcccgacgc gtgccgctgc cgacctaccc gttccagcgg 37140 
cagcggtact ggttggagtc cgccgagtac gacgcgggcg atctcggttc ggtgggcttg 37200 
ctctccgccg agcatcccct gctcggggct gcggtgacgc tggccgatgc gggcgggttc 37260 
ctgctgaccg gcaagctgtc ggtcaagacc cagccctggt tggccgacca cgtggtcggc 37320 
ggggcgatcc tgctgcccgg caccgcgttc gtggaaatgc tgatacgcgc cgcggaccag 37380 
gtcgggtgcg atctgatcga ggagttgtcc ctgacgactc cgctggtttt gcccgcgacc 37440 
ggtgcggtgc aggtgcagat cgcggttggc ggtccggacg aggccgggcg ccgctcggtc 37500 
cgcgtgcatt cctgtcgaga cgacgccgtg ccgcaggact cgtggacctg ccacgcgacc 37560 
ggcacgttga cctccagcga tcaccaggac gccggccagg gccccgatgg gatttggccg 37620 
cccaacgatg ctgtcgcggt tccgctggac agcttctacg cccgcgcagc tgagcggggc 37680 
ttcgatttcg gcccggcgtt ccaggggttg caggcggctt ggaagcgcgg agacgagatc 37740 
ttcgccgagg tcggcctgcc caccgcacac cgcgaagacg ccggcaggtt cggaatccac 37800 
cctgctctgc tggatgcggc actgcaggcg ctgggcgcag ccgaagagga tccggacgag 37860 
ggatggctcc cgttcgcgtg gcaaggtgtg tccctcaaag cgacgggcgc actttccctt 37920 
cgggtgcacc tcgttccggc gggcgcgaat gcggtgtcgg tgttcacgac cgacacgact 37980 
ggccaagccg tgctctccat cgattcgctg gtgctgcgcc agatttcgga caagcagttg 38040 
gcagcggccc gtgcgatgga acacgagtcc ctgttccggg tcgactggaa gcgaatctcg 38100 
cccggcgctg ccaagccggt ctcctgggca gtgatcggca atgacgaact cgcccgagcc 38160 
tgcggctcgg cacttggcac ggaactccac cccgacctga ccgggttggc tgacccgccc 3 8220 
ccggacgtcg tggtggtgcc atgcggtgcg tctcgccagg acttggacgt tgcttccgag 3 8280 
gcacgtgccg cgacacaacg catgcttgac ctgatccagg attggttggc ggcggcgcga 38340 
ttcgccggat ctcgcctggt ggttgtgacg tgtggtgcgg cgtcgacagg tcccgccgag 38400 
ggtgtttccg acctggtgca tgctgcgtcg tggggtttgt tgcgttcggc gcagtcggag 38460 
aacccggacc gattcgtgtt ggtcgatgtg gacggaaccg ccgaatcatg gcgtgcgctc 38520 
gcggcggccg tgcgttccgg agaaccgcag ctggcgttgc gcgccggtga agtccgggtg 38580 
cctcgcctgg cgcgatgtgt tgccgccgag gacagccgga tcccagtgcc cggtgcggat 3864 0 
gggacggtgt tgatttccgg cggtacgggc ctgctgggcg ggttggttgc ccggcatttg 3 8700. 
gtggcggagc gcggtgtccg ccgcctggtg ctcgcggggc gacgcggctg gagcgccccc 38760 
ggggtcaccg acctggtgga tgagttggtg ggcctgggag ctgcggtcga ggtggcgagc 38820 
tgcgatgtcg gggatcgggc ccagttggac cggctgctga cgacgatctc ggcagagttc 38880 
ccgctgcgcg gagtggtgca tgcggccggg gcacttgccg acggggtcgt cgagtcgctg 38940 
acaccagagc acgtggcaaa ggtgttcggc ccgaaggccg ccggtgcgtg gcacctgcac 39000 
gagttgactc ttgatctgga tctctcgttc ttcgtgctct tctcctcgtt ctccggcgtg 3 9060 
gcgggggctg cgggtcaggg aaactacgcg gcggcgaacg cgttcctgga cggcctggct 3 9120 
cagcaccggc ggacggcggg gctgcctgcg gtgtcgctgg cttggggctt gtgggagcag 39180 
cccagcggga tgaccggagc gctcgatgcg gcgggccgta gccgcattgc gcgcaccaat 3924 0 
ccgccgatgt ccgcgccgga cgggttgcgg ctgttcgaga tggcgtttcg cgttccgggc 39300 
gaatcgcttc tggttccggt ccacgtcgac ctgaacgccc tgcgcgctga tgcggccgac 39360 
ggcggtgtgc ctgcgttgtt gcgcgacctg gtgccagcgc ccgtgcggcg gagcgcggtc 39420 
aacgagtcgg cggacgtcaa cggtctggtt ggtcggctgc ggaggctgcc ggacctggat 39480 
caggaaaccc agctgttggg tttggtgcgc gagcatgttt cggcggtgct ggggcattcg 39540 
ggtgcggtcg aggtcggggc cgatcgtgct ttccgggatt tgggttttga ttcgttgtcc 39600 
ggtgtggagt ttcggaaccg gcttggcggg gtgctgggcg ttcggttgcc ggctactgcg 39660 
gtgttcgact atccgacacc gcgggcgttg gttcggttct tgctcgacaa actgattggt 39720 
ggcgtggagg ctccgactcc cgcaccggcg gctgtggcgg cggtgactgc tgacgatccc 39780 
gttgtgatcg tggggatggg ctgtcgttat ccgggtgggg tgtcctcgcc ggaggagctt 3 984 0 
tggcgtttgg tggccggggg cttggatgcg gtggcggagt tcccggacga tcgtggctgg 39900 
gatcaggcgg ggttgttcga tccggatccc gatcgtcttg ggacctcgta tgtgtgtgag 39960 
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ggtggcttcc tgcgagatgc ggcagagttc gatgccggtt tcttcgggat ttccccgcgt 40020 
gaggcgttgg cgatggatcc gcagcagcgg ttgctgctgg aagtcgcttg ggaaaccgtg 40080 
gagcgggcgg ggattgatcc gctttcgttg cgggggagcc ggaccggcgt gttcgcgggg 4 014 0 
ctgatgcacc acgactacgg cgcgcggttc atcacgaggg cgccggaggg tttcgagggt 4 0200 
tatctaggta atggcagcgc gggaggcgtg ttttcgggtc gggttgcgta ttcgtttggt 40260 
ttcgagggtc ctgcggtgac ggtggatacg gcgtgttcgt cgtcgttggt ggcgctgcac 40320 
ctggcgggtc aagcactgcg gtctggtgag tgtgatctgg ctcttgcggg tggtgtgacg 403 80 
gtgatggcca cgccggggat gttcgtggag ttttcgcgtc aacggggctt ggcggcggat 40440 
gggcggtgca agtcgtttgc ggcggctgcg gatggcaccg gttggggaga aggcgcgggc 40500 
ttggtgttgt tggagcggct gtcggatgcc cggcgcaacg ggcacgcggt tctggcggtc 40560^ 
gtgcggggta gcgcggtgaa tcaggatggt gcgtcgaatg gtttgacggc gccgaatggg 40620 
ccctcgcagc agcgggtgat cacgcaggcg ttggcgagtg ctggtttgtc ggtgtctgat 40680 
gtggacgccg tggaggcgca tgggactgga accaggcttg gtgatccgat tgaggcgcag 40740 
gctctgattg ccacttacgg gcaggggcgg gatagcgatc ggccgttgtg gttggggtcg 40800 
gtgaagtcga atattggtca tacgcaggcg gcggcgggtg tcgctggtgt gatcaagatg 40860 
gtgatggcga tgcggcacgg gcagctgccc gcgacgttgc atgtggatga acctacgtcg 40920 
gaagtggatt ggtcggcggg ggatgtccag ctcctcacgg agaacacccc ctggcccggc 40980 
aacagccatc ctcggcgggt gggcgtgtcg tcgttcggga tcagcggcac caacgcacac 41040 
gtcatcctcg aacaagcctc gaaaacacca gacgagactg cggacaagag cggtcccgat 41100 
tcggaatcga ccgtggacct tccagcggtc ccgttgatcg tgtcggggag aacaccggca 41160 
gcgctcagcg ctcaggcgag cgcattgttg tcctatttgg gtgagcgtgg cgatatttcc 41220 
acgctggatg cggcgttttc gttggcttcc tcccgggccg cgttggagga gcgggcggtg 41280 
gtgctgggag cggaccgcga aacgttgttg tccgggttgg aagcgctggc ttccggtcgc 41340 
gaggcttctg gggtggtgtc gggatccccg gtctctggcg gggttgggtt cgtgttcgcc 41400 
ggtcagggcg gacagtggtt ggggatgggc cgggggctct actcggtttt tccggtgttc 414 60 
gctgacgcgt ttgacgaagc atgtgccgga ctggacgcgc atctggggca ggacgtgggg 41520 
gtccgggatg tggtgtttgg ttccgacggg tccttgttgg atcggacgct gtgggcccag 41580 
tcgggtttgt tcgcgttgca ggttggtttg ctgagcctgc tgggttcgtg gggtgtccgg 41640 
ccgggtgtgg tgctgggcca ttcggtcggc gagttcgcgg cggcggttgc ggcgggagtg 417 00 
ttgtcgttgc cggatgcggc tcggatggtg gcgggtcgtg cccggttgat gcaggcgttg 41760 
ccttctggcg gtgccatgtt ggcggtggct gctggtgagg agcagctgcg gccgttgttg 41820 
gccgatcggg ttgatggtgc gggtatcgcc gcggtcaacg ctcctgagtc ggtggtgctc 41880 
tccggcgatc gggaggtgct tgacgacatc gccggcgcgc tggatgggca agggattcgg 41940 
tggcggcggt tgcgggtttc gcatgcgttt cattcgtatc ggatggaccc gatgttgcag 42 000 
gagttcgccg aaatcgcacg cagcgtggac taccggcgtg gcgacctacc ggtcgtgtcg 42060 
acgttgacgg gtgagctcga caccgcaggt gtgatggcta cgccggagta ttgggtgcgt 42120 
caggttcgag agcccgtccg cttcgccgac ggcgtccggg tgctcgcgca gcaaggggtc 42180 
gccacgatct tcgaactcgg ccctgatgcg acgctgtcgg ccctgattcc cgattgtcat 4224 0 
tcgtgggctg atcaggccat gccgattccg atgctgcgta aagaccgtac ggaaaccgaa 423 00 • 
actgtggtcg ccgcggtggc gcgggcgcac acgcgtggtg ttccggtcga atggtcggcg 42360 
tatttcgccg gcaccggggc acggcgggtc gagttgccga cgtatgcctt ccagcggcag 42420 
cggtactggc tggaaacatc ggattacggc gatgtgacgg gtatcggcct ggctgcggcg 42480 
gagcatccgt tgctgggggc cgtggttgcg ctggccgatg gtgatgggat ggtgctgacc 42540 
ggccggttgt cggtggggac gcatccgtgg ctggcccagc atcgcgtgct gggcgaggtc 42600 
gtcgtccccg gcaccgccat cctggagatg gccctgcacg caggggcgcg tctcggctgt 42660 
gaccgggtgg aagagctcac cctggaaaca ccgctggtgg tccccgaacg cgcggcgggt 42720 
gccggtagtc gtggccctgc gggagggacc acagtttcaa ttgaaactgc ggaagaacgt 42780 
gtgcggacga acgacgccat cgaaatccag ctgctggtga acgcacccga cgaaggcggt 42 840 
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cggcgaaggg tgtcgctgta ttcccgcccg 
cgccacgcca ccggcgaact cgtcgtcggc 
tcggctgagg gtgccgagtc gattgctctc 
gggttcgagt' acgggccgtt gttccagggg 
gttctfcgccg aaatcgcccc gccggccgag 
gacccagcgt tgctggatgc cgcgctgcag 
ggcggcgcgt ggctgccgtt ctcattcacc 
atcagcaggg tgcggctgga gaccaggcga 
gagagtgggc ggttgctcgc ctcgatcgat 
cagctggcga atcgggacgc tgtccgcgac 
gcgacgcagt cgacggaacc gggtcgctgg 
gacgatctca tcaaactcgc aacggattcc 
gccgagaaac ttgattccag cgcgctggtt 
caggcggatc ccggcaccgg cgcagccgca 
ctccaagcgt ggttggctga gccgcggttg 
gcagcggtga cgacggctcc gagtgacggt 
gggttgttgc gtgccgcgca ggtggagaac 
ggaaccgccg aatcgtggcg tgcgttgccg 
gccctgcgga agggcgcggt gcgagcgccc 
gtgcccgcgg ttgtggcgga tcccgaccga 
ttggggggcg cggttgcccg ccacctggtg 
acgggccgtc gtggctggga tgctcctgga 
ctcggtgccg tggtcgacgt ggtggcgtgc 
ttgctggcgg cggtcccggc ggaatttccg 
ctggccgacg gggtgatcga gtcgttgtca 
aaggcggcgg gggcgtggaa tctgcacgag 
gcgttgttct cctcgctttc cggtgttgcc 
gcgaacgcgt tcctggacgc attggcgcat 
tcgctggcct ggggcctgtg ggagcagccg 
gaccggagca ggatcgcgcg cgccaacccg 
ttcgatgccg ggctggcgct ggaccgggca 
ttcctggccg agcaggcgcg gtcgggctcg 
cccatccgtc gtaataggcg ggctagcgga 
ggggtggtgc gggagcatgc cgcggccgtg 
gtcgagcgcg ctttccggga tctgggtttt 
cgccttgccg gggtgctggg ggtgcggttg 
ccgagggcgc tggcccggtt cctgcaccag 
gcgccggtga cgacgaccag ggcaccggtc 
atgggatgcc gttttcccgg tcaggtgtcc 
gggggcgtgg atgcggtcgc ggacttccca 
ttcgatccgg acccggaacg ggctgggaag 
gacgccgatc ggttcgatgc gggtttcttc 
gatccgcagc aacggctgtt gctggagctg 
gatccgggtt cgctgagggg gagtcggacc 
tatggcgccc ggttcgccag ccgagccccg 
agtgctggga gtgtcgcgtc gggccggatt 
gtgacggtgg atactgcgtg ttcgtcgtcg 
ttgcgttccg gcgaatgcga tctcgccctt 



gccggtgggt cgagaggtgg gggttggacg 42900 
accaccggtg gtagggcggt tcctgattgg 42960 
gatgagttct acgtcgctct ggccggaaac 43020 
cttcaggcgg catggcgtcg tggtgacgag 43080 
gccgatgcga tggcgtcggg atacctgctc 43140 
gcgtccgcgc tcggcgaccg cccggagcaa 43200 
ggcgtcgaac tttccgctcc ggcagggacg 43260 
cccgacgcga tatcggtggc cgtgatggat 43320 
tctctcaggc tacgaagcgt gtcgtcggga 433 80 
gcgctgttcg aggtgacctg ggagccggtg 43440 
gccctgcttg gtgatactgc ctgcggtaaa 43500 
gccgaccgct gcgcggatct ggcggcgcta 43560 
cctgatgtcg tggtctactg cgccggagaa 43620 
cttgcggaga cccagcagac gttggctctg 43680 
gccgaggcac gtctggtggt ggtgacgtgt 43740 
gcatcagagc tggcacatgc gccgttgtgg 43800 
ccggggcagt ttgtgctggc ggacgtcgac 43860 
agtgcgttgg gctcgatgga accgcagttg 43920 
cgcttggctt cggtcgccgg gcagatcgac 43980 
accgtgctga tttcgggcgg cacgggcctg 44040 
accgaacgcg "gtgtccgccg attggtgttg 44100 
atcaccgagt tggtgggtga gctgaacggc 44160 
gacgtcgcgg atcgtgctga tctggagtcg 44220 
ttgtgcggcg tggtgcatgc cgcgggggcg 44280 
ccggacgacg tgggagcggt gttcggcccg 4434 0 
ctgactcgtg atacggacct gtcgttcttc 44400 
ggcgctcctg gtcagggcaa ttatgcggcg 44460 
taccggcggt cacagggact gcctgcggtg 44520 
agcgggatga cggagacgct cagcgaggtc 44580 
ccgttgtcca ccaaggaggg attgcggctg 44640 
gcggtagttc cggcgaagtt ggacaggact 44700 
ctgcccgcat tgttgacggc actggtaccc 44760 
accgagctcg cggacgaggg caccctgctc 44820 
ctggggtatt cgagcgcggc tgacgtcggg 44880 
gattcgttgt ctggtgtgga gttgcggaac 44940 
ccggcgactg cggtgttcga ctatccgacg 45000 
gaactggcag acgagatcgc tacgacgcca 45060 
gccgaagacg atctcgtcgc gatagtcggg 45120 
tcgccggagg agctctggcg tttggtggcc 45180 
gccgatcgcg gctgggatct ggcaggcttg 45240 
acctacgtgc gggaaggggc cttcctcacc 45300 
gggatttccc cgcgtgaggc gttggcgatg 45360 
tcctgggagg ccattgaacg ggcagggatc 45420 
ggtgtgttcg cggggctgat gtaccacgac 45480 
gaaggtttcg aggggtatct cggcaatggc 45540 
gcgtactcgt ttggtttcga gggtcctgcg 45600 
ttggtggcgt tgcatttggc gggtcagtcg 45660 
gccggtggtg tgacggtgat gtcgacgccc 45720 
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gggacgtttg tggaattctc ccgtcagcgg ggcctggcac cggacgggcg gtgcaagtcg 45780 
ttcgcggaga gcgcggacgg taccggttgg ggtgagggtg ctggtttggt gttgttggag 45840 
cggttgtcgg atgctcggcg gaatgggcat cgggtgttgg cggtggttcg tgggtcggcg 45900 
gtgaatcagg atggtgcgtc gaatggcttg accgcgccga atggtccctc gcagcagcgg 45960 
gtcatccagc a§gcgttggc gagtgcgggt ctgtcggtgt ccgatgtgga tgccgtggag 46020 
gcgcatggga ccgggaccag gttgggtgat ccgattgagg cgcaggctct gattgctacg 46080 
tatgggcgcg atcgtgatcc cggtcggccg ttgtggttgg ggtcggtgaa gtccaacatc 46140 
ggtcatacgc aggcggcggc gggtgttgcc ggtgtgatca agatggtgat ggcgatgcgg 462 00 
cacgggcaac ttccgcgcac gctgcacgtg gatgcaccct cctcgcaggt ggattggtcg 46260 
gcggggaggg tccagctcct gacggagaac acgccctggc ccgacagtgg tcgcccctgt 46320 
cgggtggggg tgtcgtcgtt cgggatcagc ggcaccaacg cgcacgtcat cctggaacag 46380 
tccacggggc agatggatca ggcagcggag ccggattcga gtcctgttct ggatgttccg 46440 
gtggtgccgt gggtggtgtc gggcaaaaca cccgaagcgc tatccgccca ggcggcaacg 46500 
ttggcgacct atttggacca aaatgttgat gtctcccctc tggacgttgg gatttcgctt 46560 
gcggtgaccc gttcggcgct ggatgagcgg gcggtggtgc tggggtcgga tcgtgacacg 46620 
ttgttgtctg gcctgaatgc gctggctgcc ggtcatgagg ctgctggcgt ggttacggga 46680 
cctgtcggga ttggtggccg gaccgggttt gtgttcgccg gtcaaggcgg tcagtggttg 4674 0 
gggatgggcc gccggttgta ctcggagttt ccggcgttcg ccggtgcttt cgacgaagca 46800 
tgcgccgagc tcgatgcgaa cctggggagg gaagtcgggg ttcgggatgt ggtgttcggc 46 860 
tccgacgagt ccttgctgga tcggactttg tgggcgcagt cgggtttgtt cgcgttgcag 46920 
gtcggtctct gggaattgtt gggtacgtgg ggtgttcggc ccagcgtagt gctggggcat 46980 
tcggtcgggg agctagccgc ggcgttcgcc gcaggtgtgc tgtcgatggc ggaggcggct 47040 
cggctggtgg cgggtcgtgc gcggttgatg caggcgttgc cttctggcgg tgccatgctg 47100 
gcggtgtccg cgaccgaggc ccgagtcggc ccgctgctcg atggggtgcg ggatcgtgtt 47160 
ggtgtcgcag cggttaacgc tccggggtcg gtggtgcttt ccggtgaccg ggatgtgctc 47220 
gatggcattg ccggtcggct ggacgggcaa ggtatccggt cgaggtggtt gcgggtttcg 47280 
cacgcgtttc 'attcgcatcg gatggatccg atgctggcgg agttcgccga gctcgcacgg 47340 
agcgtggact accggtctcc acggctgccg attgtctcga cgctgaccgg aaacctcgat 47400 
gacgtgggcg tgatggctac gccggagtat tgggtgcgcc aggtgcgaga gcccgtccgc 47460 
ttcgccgacg gtgtccaggc gcttgtggac caaggcgtcg acacgattgt ggaactcggt 47520 
ccggacgggg cgttgtcgag cttggttcaa gagtgtgtgg cggagtccgg gcgggcgacg 47580 
gggattccgt tggtgcggag agaccgtgat gaggtccgaa cggtgctgga cgctttggcg 47640 
cagacccaca ctcgtggtgg cgcggtggac tgggggtcat ttttcgctgg tacgagggca 47700 
acgcaagtcg accttcccac gtatgccttc caacgacagc ggtactggct ggagccatcg 47 7 60 
gattccggtg atgtgaccgg tgttggcctg accggggcgg agcatccgct gttgggtgcc 47820 
gtggtgccgg tcgcgggcgg cgatgaggtg ctgctgaccg gcaggctgtc ggtggggacg 47880 
catccgtggc tggcggaaca ccgcgtgctg ggcgaagtcg tcgtccccgg caccgcgttg 4 7940 
ctggagatgg cgtggcgggc cggtagccag gtcggttgtg aacgtgtgga ggagctcacc 48 000 
ttggaggcac cgctggtcct gccggagcgg ggcgctgcgg cggtgcagtt ggcggtgggg 48060 
gctccggatg aggccggccg gcgcagtttg cagctctatt cccgaggcgc tgatgaagac 48120 
ggcgactggc ggcggattgc ctccgggctg ttggcccagg ccaatgcggt gccgccggcg 48180 
gattcgacgg catggccgcc ggacggcgcc gggcaggtcg atc.tggcgga gttctacgag 4 824 0 
cgcctcgccg agcgcggctt gacctacggt ccggtattcc aagggctccg cgccgcatgg 48300 
cggcacggcg acgatatctt cgccgaattg gccgggtcac cagacgcctc gggtttcggc 4 8360 
atccacccgg cgctgctgga cgctgcactg cacgcgatgg cgcttggtgc ttcgcccgac 4 8420 
tcggaagcgc gtctgccgtt ttcctggcgt ggcgcccagc tgtaccgcgc tgaaggagca 4 8480 
gcgcttcggg tacggctctc gccgctgggc tccggtgcag tctcattgac gttggtggat 48540 
gccacagggc gacgagtcgc tgcggtggaa tcgctttcga cgcgaccggt ctccaccgac 48600 
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cagatcggtg ccggtcgcgg cgatcaagag cggctgctgc acgtcgagtg ggtaaggtcg 48660 
gctgaatctg cggggatgtc tctgacctcc tgcgcggtgg tcggtttggg cgaaccggag 4 8720 
tggcacgctg cgctgaagac cactggtgtc caagtcgagt cccatgcgga ccttgcttcg 4 8780 
ttggccaccg aggttgccaa gcggggttca gctcctggtg cggtcatcgt cccgtgcccg 4 8840 
cgaccccgag cgatgcagga gctgccgacc gccgcgcgaa gggcgacgca acaggcgatg 4 8900 
gcgatgctgc agcaatggct tgccgatgac cggttcgtca gtacgcgcct gatcctgctg 48960 
acgcatcggg cggtctccgc agttgctgga gaagacgtgc tcgacctggt acacgcgccg 49020 
ctgtggggct tggtccgcag cgcgcaagcg gagcacccgg accgattcgc cttgatcgat 49080 
atggacgacg agcgagcatc gcagacggca ctcgccgaag cgctgactgc gggagaagcg 4914 0 
cagctcgcgg tgcggtcggg agttgtgctg gcgccccgcc tcggccaggt gaaggtgagt 49200 
ggaggtgaag cgttcaggtg ggatgaaggc accgtgctgg tcaccggcgg aaccggcggg 49260 
ctcggggccc tgctcgcacg ccatctggtc agcgcccacg gtgtgcggca cctgttgctc 49320 
gcaagtcgcc-gtggtctggc ggcgcccgga gcggatgagc tggtggccga gctggagcag 4 93 80 
gccggcgccg acgtcgcggt cgtcgcgtgc gactcggcag atcgggactc gcttgcgcgg 4944 0 
ctggtggcgt cggtgcctgc ggaaaacccg ttgcgggtgg tggtgcacgc cgccggtgtg 49500 
ctggatgacg gtgtgctgat gtcgatgtcg ccggagcgct tggacgcggt gttgcggccc 49560 
aaagtggatg ccgcgtggta cctgcacgag ctgactcggg aactcggtct gtcggcgttc 49620 
gtgttgttct cctcggtcgc gggcctgttc ggcggtgcgg ggcagagcaa ttacgctgcc 49680 
ggcaacgctt tcctggatgc cttggcgcat tgccggcagg cccaggggct gcccgcgctg 49740 
tcgctggcct ccgggctgtg ggcgagtatc gatggaatgg cgggcgacct cgctgcggca 49800 
gatgtggagc ggctgtcgcg ggcaggcatt ggcccgcttt cggcaccggg agggctggcc 49860 
ttgttcgacg ctgccgttgg ctcggacgaa ccgttgctgg caccggtgcg actggatgtc 49920 
gaagcactgc gtgtgcaggc ccgatccgtg cagacccgga ttccggaaat gctgcatggc 49980 
atggcaatgg ggccaagccg 50000 



<210> 3 
<211> 25360 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 3 

ccgcactccg ttcacttcca gggttgagcc 
gggcgaacgt cggcagcaag tgctccagcg 
gcacggcagg tcgagcgatg tggacatcga 
gctgacggcc atcgaactcc gcaaccgtct 
gacgctggcc ttcgaccacg gcactgcggc 
aggcaccgcg accgcgccgg caccgaggcg 
gaggtcgctc ttccaacagg cgtatgcggc 
gaaggtcgct gcccagttgc gaccggtgtt 
gaaacccgtc cagctttccc gtggtcccga 
gatcgggatg ccgcccgcac agcagtacgc 
ggacgtttcg gtgatcccga tgcctggatt 
cgaggtggcg gttcggacgc aggcggaggc 
cgtactggtc gggcattcct ccgggggctg 
gcgtcgcggg gtcgtcccgg ccggggtcgt 
cacgccgagg ttctccgtgg cgatggccca 
ggacatgcag gatgtcggta tcaccgcgat 



gttgcacgaa cggctggccg gattgtcgga 60 
cgtccgcgcc gatatcgcgg tggtactggg 120 
gaagcctttg gccgagctgg gtttcgactc 180 
cgctaccgcc accggactgc ggcttcccgc 24 0 
ggcactcgcc cagcacgtgt gcgcgcagct 3 00 
aaccgacgac aacgacgcca cggagcccgt 360 
tggccggata cttgacggga tggatttggt 420 
cggttcgcct ggcgagctgg aatccctgcc 4 80 
agagcttgcc ttggtgtgca tgccggcgct 54 0 
gcggatcgcc gccgggttcc gcgatgtgcg 600 
cattgcggga gaaccgctgc cgtccgccat 660 
ggtgctgcag gaattcgccg ggggctcgtt 720 
gctggcgcac gaggtagccg gtgagctgga 780 
actgctggac acctacatcc ccggtgagat 840 
ccggacgtat gagaagctcg cgactttcac 900 
gggcgggtac ttccggatgt tcaccgagtg 960 
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gactccgacg ccgatGggtg ctccgacgct 
ccctgaaggg cggccgtgga cagatgactc 
cacggtccag gtgccgggcg accacttctc 
acaggcagtc gcgagttggc ttgacaaact 
ggcgtccttt taggaccttc tgggcggcac 
tccaggctcg ccgatcttga cggcgcacga 
ctgccgctcg tggccatcgg cctggcgaat 
cgtccgagcc cgcattccga ggtgaggggc 
cccctcacct cgaaattcgt ccgatttggg 
gtctttcttg ttgacagagc ggtgagaagc 
atgatgctca agcgccaccg tttgacgacc 
ctggtcagcg gctgcggaac cgccgccgca 
gcgcgcaatg ttggtatggc ctcgggcggg 
tcggaggccg atttcctcgc caccgcgaca 
gtggcgtacg ggaaccggtc ggacaagacc 
aagggcgtgg acatgagcaa ctcgccgatc 
cccgacgcgc ccaaggaatt caccctccag 
atggtcctgg ccgacagcgg cgacccgaac 
ctgccggaca tgtccgaggc ccagccggtc 
gccgcgaagt acctgcgagt cagctcgctg 
ccatgtgagt cccgcagatt cgacctcgcc 
tttgtcgttg tgaccagcgt tgttcacgtc 
ctcgggcgcc tccaacgggg cctggcatcc 
ttctctgtcg ggcgttgtca cgccgccggc 
tggtccagcg cggtgtggcg gcggccggag 
tttttcgaac cgacgaggac cacgaccttc 
ttgcgctgca gcatcatcca ggcgcttgtg 
cttccctgcg aacgatggat ggtcaccgcg 
aagtcgtagt cgatgtcctc gtcctcgtcg 
tcgagggcgg acacgacgcc ctgcgtgccg 
ttgttgcgga tctgcgtgac cttgtcgccg 
tctggcaggc cctccctggc cggggtgatc 
gcacctgcgg ggcctcgatg catcggggcg 
cggaacttcc gcggaatccg gcgggcgacg 
ctttcctcta cgtggaacag gaagaagtcg 
gcgttgattc ggtgcgcgtt ggtcaccacc 
ttgagccgca cgtgtggaat cggggtgcca 
gctccgaccg acgggagctg gtcgacgtcg 
atcgccttgg ccagtttgtt ggctaacagc 
acgaggtcgg cgtccagcgg gttgtcccgg 
tggagcaggc ggtgcacggt cgccgcgtcg 
gctcgtcccg tcggcgcggc gaggatcacc 
atcgaccgca cggtgaagct cttgccgcag 
tcggtcaggg ccagcttgac ggcgcgctcc 
cggcgcaacc agtcgagggc cttgtgccaa 
gcgctggtgt tcagcagccg ggacagctgg 
ggcacgaggt- agatcgcgac cgtcggcacc 
accacacctt cctcggtgac gagttcggcg 



gttcgtgcgg accgaagatt gcgtcgcaga 1020 
ctggcggcca gggtggactc tcgcggatgc 1080 
gatgatggac. gagcacgccg ggtccaccgc 114 0 
caaccagcgc "accgctcggc aacgctgacg 1200 
cggccacccc ggcggtgccg ccttccgtgg 1260 
tgcgcggcac gcgcgctgat cgtgattccg 1320 
catgtccttt cgggcaacgt caaacgaatt 1380 
acccttgggt ggctgagccg ctcaagggtg 1440 
cggtggacgc aaccccggtg ggcgtggtgc 1500 
cgctgacaca cctgagagga aaaggggagc 1560 
gccatcaccg gccttctggg gggagtactg 1620 
cttcagtcct cgccggcgcc cgggcatgac 1680 
ggcggcgggg acatcggcac gtcgaactgc 1740 
ccggtgaaag gcgaccccgg cagtttcatc 1800 
tgcacgatca acggcggcgt gccgaacctc 1860 
gaggacctgc cggtcgagga cgtgcggctt 1920 
cccggtcaga gcgcgtacgc cggcattggc 1980 
gcccatgtcc tcaccgggtt ccagtcctcg 2040 
aacgttctcg gcgacggcaa cgtgaagttc 2100 
gtgtctaccg cagacgagct gcgctaaaac 2160 
gtgcggcgcc tccggcgaag cgtccgtacg 2220 
cgggcgcagc gctggtacat actcaggcgt 2280 
ggggccgtcg agtgcggcgg cgctgacgcg 2340 
ctcgaaccgg tcccgccccg tcggagccgg 24 00 
ccgacggtgc gcaccgcctg cccgagggcc 24 60 
ttggcccggg tgaccgccgt gtagagcagg 2520 
gtcaagggga tcaccacgca cgggtattcg 2580 
taggcgtgga ccagttcgtc gagttctgtg 2640 
gttcgcacgg tcatggtctg tgcttcgttg 2700 
ttgaacacgc cgttggcgcc cttgtcgtag 2760 
acgcggaaga tccgtccgcc gaaccgccgc 2820 
gcttcctgca acagctggtt cagcgcgcct 2880 
aggacctgca cgtcggtgcg cgggttgaac 2940 
acgtcgacgg tgagctcggc ggtcggttcg 3000 
gtcagcccgt gtgtcagcgg atagtccccg 3 060 
ccggactcgg cggcctgccg gaacacctcg 3120 
ggggcgagca gatcgcgcag tacctcaccg 3180 
ccgaccagca gcaggtgcgc gccgggcgcg 324 0 
aggtcgagca tggacgcctc gtcgaccacg 3300 
tcgtaggcgg cgtccccgcc cggctggagt 3360 
tgtccggtga gctcggtcag ccgcttcgcc 34 20 
ttggcctttt tcgcctgagc taatgcgatg 3480 
cctggacctc cggtgagcac ggcgaccttc 354 0 
tgcgcctcgg cgagttcggc accggtagcg 3600 
tcgacgtcgg cgaagacggg catccggtcc 3660 
ttggccaggg cgacttcggc gcggtggaag 3720 
tcgtcgtcat cggtggggat ctcctcgcgg 3780 
aggcattcga tcaccagccc ggtgtcgacg 384 0 



WO 01/16303 PCT/EPOO/08013 



gcgaggatct tcaccgcctc ggcgatcagc 
ccggtggact ccgacagcgt gaactgaagg 
gggattccca ccgctttggc gatggtgtcg 
cctgccagcc ggtatggctc ttccttgacg 
tagatcttca ccgccagcga ggtcgagacg 
ttgatcgcct tctgctcctc ccacgcgtcg 
ttggggacct cgatcagccg cgcgggttcc 
aagtggtcga cgatcttctc ggcgagtttg 
aggtagcggc ggataccttg cacggtcgca 
aactgccgcc cgtactgggg gtgcgacccc 
ggctgcgcgc ccagcagcgc gccgacgacc 
atccgcgcga cggtgtagcc gctctcctcg 
tccagcaccg cagtccacgt ggccgactcc 
gaacggctgt caagcaaacc ggcggtcacc 
gctccggcgt cagcgggtgg gcgttcgcga 
■ ttttttcgtc ctcgcgcagg gcctggagta 
gccgggataa accggcttga cggaggagat 
ccacgacgac cccgagtcat gcgtcgacga 
gggcacgtgg gggcgtgcga tcaggtcgcg 
cagcagccgc gacggactgg ttccgctcct 
agctcgatgc cggccgcgtt cgcgaggtgg 
agtctgcctt ttgaacatct ggtcaattca 
ctgcccgcca attgtttgct cgccgtgcgc 
gcgtgctggc gaaggtgtcg attggctcgg 
tcgccacgag cgtcggcgat gcccggtact 
tccacgggcg cggtgcgctc ctgatcatga 
gccgtcgtgg ggcactgtga ggatgtgcat 
cctgatcgcg ggctgccgcg gctgtgccca 
gagcgtcagc ccctgcgcgt cgagtgcgac 
ctgctcggtt gccggtgcgt tcgcctgcac 
cacatcggtg acgaccgagc cattcgcggt 
caggaactcc gcctgctgcg gtccgttcgc 
ccagcaggcg tggtcgttgt tgttcatcga 
caggaacgtc agaccggtgc ggcccagcga 
gaggtagaac tgcggcctgc gtaggaacag 
ggtgaagtgc gcctgttggt atggaagttt 
cggatagcgc tcgccgtaga tcgcgtgggt 
gagtgcgggc accggtgccg ggtcctcggc 
gcggctggac aggaacgcgc ccaagttcgg 
ggtacgttcc gggtccggtc gcacgtcggc 
ggcgacgttg gcgaaccagc cggagccgtc 
gtcggtgtac ttccgcgcca tcgtgacgag 
ggccatctcc ggcagcatca cctcgaagtt 
gaagtagccc gaaggactct gcccgttcgc 
ctgccgcagc gcggggtcgg ' ggtcctggtt 
ggtaacctgg ttcgcgtagt ggattgtggg 
gagccacgtc atcgctttgc gcagcgccga 
gttcgccgcg cgcagcaggg cgttggtctt 



tcgttctccg gcaggtagca gttgccgtcg 3900 
cccgccttta cccgctgcgg ggagtcgtgc 3960 
gcggtcttga aaccgattcc ccacacgtcg 4020 
gtccggatcg cgtcgtcgtg gtactgcttg 4080 
ccgacgcctt gcaggaagat catcacctcc 4140 
gcgatcagct tcgtccgctt cgggccgagc 4200 
tgctcgatga cgtcgagcgc ggcgacgccg 4260 
gggccgatgc ccttgatcag gccagacccc 4320 
ggcagcacgg tcgtgtagtc gtcgacgtgg 4380 
caccggccgc gcatgcgcaa cgcctcgccg 4440 
gtcaccaggt caccgccccg gccggtgtcg 4500 
ttggcgaacg tgatccgctc cagcgtgccc 4560 
cgtccttttt ccaccgacaa cacgtatcac 4620 
acatgcagcg gcatctcccg aacgcctcgg 4680 
tgccttggtg cggccggtgg gagttgtaga 474 0 
ggtgccgctg gctccagatc aggggaaagc 4800 
ggaagagctc tacgtcgaat ggccagcgac 4860 
cctgcgaggc cgtggcgaag cgttgacagg 4920 
gcgccggctc agcgccgtgt ccggacgcac 4980 
cgagcgcggt gtcctcgatc gtctttttcg 5040 
ccagtccagt cggtcggcag tcgtggtcgc 5100 
gctttgaatg atcgttcggc gagtattgtc 5160 
cctgggacag gagcgcggcg ttcggggacg 522 0 
acggccggac caggatcgcc gacgtcgtga 5280 
atctggagca ccagcgagtg gacaacgact 534 0 
atcaagaact gagaccgtca gggtgaacgt 54 00 
cacacggccg gggtaggtga tcgaggaggc 5460 
gtcgaatcgc agcaccacct ggccgcgggt 552 0 
tttgtcggag tccctcacca ccaacgggat 5580 
cgacctggtg atccggtcgg gccgcacggt 564 0 
gcggtagcgg aaccggtatg gctcgccgcc 5700 
gtccggcacc tggcccggga agaccgtcgc 5760 
ctggacgacc gtgcccgcga ccgggtgcca 5820 
ggtcgcgcgc ctcgtgccga agtagccgcc 5880 
gaagtcttgc ccctggtcgc tgcggatctc 5940 
cgcgatcgcg gccgtgcgct gggcgcgggt 6000 
gaggatgcgc ggtgaggtgt cctgtttgac 6060 
ccaggccgct cgtgcggcgg cgaggtcttc 612 0 
caccactggc acgaactggt tgttcagcgc 6180 
gtagaaccgc gccgaggtgc gggagttggg 6240 
gggttcgcgc agcaggttgt agctgagcca 6300 
gttggcgttg ttggactggc gccacgcgtc 6360 
gtagttggtg tccgcgccgg tcaactcgta 6420 
ggcaaggcgc gcgaacgcct cggtcaactg 6480 
cagcgcgagc gaggagccgg cgagcccggc 6540 
ctgccaggcg ctaccgttgc ccgggttgag 6600 
ggtgatctgc gactggcgtt gcggcaggat 6660 
gctcaggtac cccaacccga agccggtggc 6720 
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ggcgagcccg tgctcggtcg gcgaatactc cggccacgag ccgtcatcgt gctgcaaccc 6780 
taggtagtgt ccgagcccgg cgtcgagcgc ggcgagcaga gtcgcgtcgc cccggtacgg 6840 
gttccacgtg cgggattgcg tggcgaacca ggcgagcgtg tagacgtgct cctggacgcg 6900 
ggcgttgtag gacaccgccg ggctgcgcca ccagcctccg gcgaagaagc cgctcgagtc 6960 
catgtcggcg accatcggcg agaccgcggt gaggtacgac gcgaatcgct gctcctcggg 7020 
tgcgaacagg cgccggttcg gtgtggtggg cggcggggcg ggcagtgcga gtgcgcgggt 7080 
cggcagtgcg gcgagcagtc cgagagcggc ggcgccggtc atcaggctgc gacggctgaa 7140 
cgtagtcacg ggcctacctc cttgtggccg atcaaccctc acccgctgcg tagccgcacg 7200 
tcaagatgat aattcgaatt attatgggct tgacgacgcg taggccgacg acgcagaatt 7260 
cctgccaatt cgtattggca agcgggggtg ctcgtggccc gacggctcac ccagcaggac 732 0 
atcgcccgga tggcaggagt cagccaggcc acggtgtcgc tggtgctcaa caaccggaag 7380 
gacggcaacg tccggatcgc ggcggagacc cgtgcgcagg tactggaggt gattcggaag 744 0 
accggctacg tcgcgaaccc gatcgcccgc aggatgcgcg atcggcacaa ccgcatcctc 7500 
ggcgtgttca cctacgaggc ggtgttcccg agcacccacg cgaacttcta ccagtcgttc 7560 
ctcgaaggca tcgaggaaca ggccgaggag gtcggctgtg acctgttgtt gttcaccagc 7620 
gccaaggcca ccggggagcg gcggcggatt ttcggcgacg acagccgggt gcggctcgcc 7680 
gacggcagtc tgctgctcgg tcgcacggtc gaccgcgacg acctgaccca gctgctcgcc 774 0 
gaaagcatcc cgtacgtctc catcggacga cgcgacgacg cgggcggtcc ggtgccgcac 7800 
gtcggggccg actaccgcac cgcggtgcga gacctggtgg accgcgcggt cgcgctcggc 7860 
caccgcgggt tcgcgtacgt ggggtctggt gggggcgcgg agtcgtccgc ggatcggctg 7920 
cgaggcttcc gcgaagccgt tgccgcacat ggcgtccaag ggatgcatgt ggagacccca 7980 
cagc.tcgatc agctgcgcga agcgggcgtc accgctgtgc tcaccgaaga ggtgtcggac 804 0 
ggggccgcgc tcgtgctcgc cgggcgcgaa cgcgggctct ccgtgccggg cgacctcgcc 8100 
gtgctctcgc tcggtgccgc tacccggtcg gcaccggacg acgacgtggc gctcaccggt 8160 
ttccgcatcc ccaggcgcga gatggggcgc cgggcggtgc aggcgctgac cgaggtgctc 822 0 
gaaaacggca ccacaccgca agaactgctc ccgtgcgagt tcgtcgaggg ctcgacgctg 8280 
ggcgcaccac gcctttgacc aggaggaact gttgctcgac cacaccacgg acgttgtcgt 834 0 
cgttggcggc ggactcggcg gtgtcgccgc cgcactcgcg ttgctgcgcg cgggccgccg 84 00 
ggtcgtgctc accgaggagt acgactggct cggcggccag ctgaccagcc aggccgtgcc 8460 
gcccgacgag cacagctggg tggagcgctt cggcgtcacc gcgagctacc gggcgttacg 852 0 
cgacggcatc cgcgactact accgccgcca ctacccgctg accccgcgcg cacgggcgtg 8580 
gcgggagctc aaccccggtg cgggcaacgt gagcaggctc tgccacgagc cccgcgtcgc 8640 
cgtcgcggtg atcgacgaga tgctggcgcc gttccgcggc agtggcaggc tgaccgtgct 8700 
gcagccgtac cggccggtgg ccgcgcacaa cgacggcgac cggatcgtgt cggtgaccgt 8760 
tgcgcaccgc gacaccggtg aacagatcga gctctccgcg ccgtacatcc tggacgcgac 882 0 
ggagacgggt gaactgcttc cgttgtccag cacggagtac gtcaccggct tcgagtccac 8880 
tctggacacc ggcgagccga gtgcgcccga cgtcgcgcag ccggcgaaca tgcaggcggt 894 0 
gtcggtgtgc ttcgtggtcg accacgtcga cggcgaccac accatcgaca aaccggcgcg 9000 
gtacgacttc tggcgcgcgt accagccgga cttctggggc gaccggatgc tgtcgttccg 9060 
ctcccccaac ccgcgcacgc tcgcgatctc cgaacgtacg ttcaccccga acccggacga 9120 
cgacccgctc gg'cgtcgtgt cggaccagcg gctcagtgcc ggtgacagca atctgtggac 9180 
gttccggcgc atcgccgcgc gtcgcaactt cgtcgagggt gcctacgaca gcgacatctg 9240 
cctggtgaac tggccgatca tcgactactt cgagtcgccg gtgatcgacg tgccggacgc 9300 
cgacgcgcac atcgccgcgg cgcgggaact ctcacgttcg gtgctctact ggctacagac 9360 
cgaggcgccg cgcccagacg gcggcaccgg cttccccggc ctccgcctgc gcggcgacgt 9420 
caccggcagc gcggacggtc tcgcgcaggc gccgtacatc cgcgagtcca ggcgcatcag 94 80 
ggccgagcac acgatcgtcg aacaggacct ctcgctcgcc gtgcgcggcg acaagggtgc 954 0 
ggtgcagcac gccgacgccg tgggtgtcgg catgtaccgt atcgacctgc acccctccac 9600 
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cggtggcgac aactacatcg acgtcgcgag ctgcccgttc gagatcccgc tcggcgcgct 9660 
gatcccgcaa cgggtggaga acctgctacc cgcgggcaag aacatcggca ccacccacat 972 0 
caccaacggt tcccaccggc tgcacccagt cgagtggaac gtcggcgagg tcgcgggcgc 978 0 
gctcgctgcc ttctgcctgg cgcaccgagt ' cacccctcgc gcggtgcgca atacccctgg 984 0 
cctgctcgcg gacttccagc agtgtctgga acgcgacggg gtcgagctcc gctggccgga 9900 
cgtgtccggc tactgacgca gggagacgaa aatgacaaag ctgtcacgac gactcacggc 996 0 
actcatcgtc gcagggctgt tcgccctcac cggctgcggt ggtggatcaa ccgcacagtc 10020 
cggaccgaag tcgctgcgca tgaccgtgtg gactgccaac gcggcgcatc tcaagctgct 10080 
caacgacatc gccgccgagt acaaggcctc gcacccggac atcgccgaga tcaagttcga 10140 
ctcggtgccc gccgacggct acaccaccac gctcaccacc cagatcgccg gcggtaacgc 10200 
gccggacctg gcctggatcc tggaggagtc ggcaccggac ttcgtggcgt ccggtgcgct 102 60 
cgccccggtg cgcggcaaga tcgagaaggc cgacgagctc gtgccgtccg -cgacgaagct 10320 
gtgggagaag gacggcgaac tgtacgccta cccgttctcc acctcgccgt tcggcttgtt 103 80 
cgtcaacacc gacctggtga agggcgcctc ggcggactgg acctgggacc aggcgatcgc 10440 
ggctgcctct gcgtcggcgg ccgcctccgg caagggcggc ctggtactgc cggacttcaa 105 00 
gtaccagaac tgggcagtgc tgtcctctat ctggcgcggc tggggagctg atgcgtggag 10560 
cgcggacggt cgctcgtgcg ggttctccag cagcgagatg aacgacgcga tgtccttctt 10620 
gcacaaggcc atcttcaccg acaaggcgat tccgggcccc ggcacgacgg tggacttctt 10680 
cgccggcgac gcggcgatgg cgatcggcca gatctcccgg tccagtgcgt tgaaggacgc 10740 
gaagttcggc tggacgctgc tgccgctgcc ggccggtccg aagggtgact acgcggtgat 10800 
cgggcaggcc gggatcggtg tgctgaagca gtcgcacaac gtcgacgccg cgacggactt 10860 
cctcgccttc ctcaccaacc agaccaactc cgccaagctc gcgcagttct tccccccggc 10920 
gcggtcgtcg ctgctcaacg cggagacgct cgccaagagc aacccgttga tcaaggccga 10980 
gcaactgcag tccgtcgtcg tcgacggcat caacaagggc gtcgtgaagc cgagccacaa 11040 
gggtcaggag gagctgaacc agacgatccg cgccgcgctc gacccgttgt ggaagccgga 11100 
cgcgaacgtg cagaacgtgc tgaacgacgt gtgcaccaag atcaaaccgc tgctggagaa 11160 
caagtgacgg cggtcgcaca ctccacgcgg gccgtagccc gcaaggggac gtcgtactgg 11220 
acgcagcgca ggcgcgacaa tctggttggt tacctgttcg tcgcgcccgc gctgctcggc 11280 
agcatcgcgt tcgtgctggt gccgctggcc ttggtcggct ggtacagcct caacgagtgg 11340 
aacgtgctcg ccggcacgtt cgagttcgtc ggcgcgcaga actaccaaca gctgctcgcc 11400 
gacgagaagc tgcgcgactc gctggtggcg accacttggt tcgccgccgg cctggtcgtg 11460 
ctcaacctgt cactagcgct gctgctggcc gtgctgctca accagaagct gagcggcacc 11520 
acggtcttcc gcacgttgtt cttctctccc gtcctcgtgt cgctggtggc gtggacgctg 11580 
gtgtggcagc tgatactgca gccggagggc agcgtcaacg gactgctcgg gttcttcggt 11640 
gccgacgggc cgaactggct gcgcggtgag tcgaccgcga tggtctcggt gatcgtcgtg 11700 
caggtgctca agaacgtcgg cctgaacatg gtgctgttcc tcgcggcgct gcaaggcgtg 11760 
ccgcagccgt tgtacgaggc ggcgaagatc gacggggcgg gtgcctggac ccggttccgc 11820 
cgcatcacct tgccgttgat cagcccgacg atcctgctca cgtcgatcat caccatcgtc 11880 
ggctcgctgc aggtgttcgc gcagatcgcg gtgctcacgc agggcggtcc gggcacgtcc 11940 
acgaccgtac tgatctacta cctgtaccag caggcgttcc agttccacca cttcggctac 12000 
ggcgcgacga tctcggtact gctgttcgtc atcgtcgccg cactcaccct gctgcagtgg 12060 
cagatgcgcc ggaagtgggt gctgcatgag gcttagggtc aagatcacgc tatacgggct 12120 
gctgtgcctg ctgtgcgtgc ctttcgtgtt cccaacctgg tggatgatca cctcgtcgat 12180 
gaagcccatc agcgagatct tctccacctc gccgctgccg tcggagtgga cgttctccac 12240 
ctaccggcag gtgttcgaga tgcagccgtt cgcgcagcag tactggaaca gcctctacat 12300 
cgcggtgatc gtcacgacag gcacaatggc ggtggcggcg atggccggat acgcgttcgc 12360 
gcgtatccgg ttcccgggcc agaacgtgct gttcgtggtc gtcctgatcg gcctgctcat 12420 
cccgagcgag gtcaccatcg tgccgctgtt caagatgttc cagtcgctcg gcctgaccaa 12480 
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cacgcactgg ccgctaatca tcgtgccgat cttcggggcg ccctgtgtgc tggcgatctt 12540 
catcatgcgc cagttcttca tcgccctgcc gtcggagctg gaggaggccg cgcggatgga 12600 
cggcctcgga cgagcgggca tcttctaccg .gStQgccctg ccgctgtccc ggcccgcgct 12660 
gggcgccgtc gcgatcttca cgttcctgca ctcgtggaac ctctacctgg agccgatcgt 12720 
ctatctgtcc acaccggaca tgtacacgtt gccgcaagcg ctcacgcagt tcgtcgacaa 12780 
ctacggcggc ccgatgtgga acgtgcagct cgccgccgcg accatgaccg cagtgccgat 12840 
tctcgtcgtg ttcgtgctgg cgcagcggca gttcatcgag ggactcgcgc acaccggggt 12900 
taaggggtga gtcccgcgat cgacccggag atcgccgcgc tggtagacct cgccagacca 12960 
ccggtgacgc cagcgggaat cgacgcggtc cgggcgggag gacgcgtcgt caccgatgcc 13 020 
gagctgacgc gcggtggcac cgtgacgttc gccgatgccg atgccgatgg cgtgcccgtg 13080 
cttgtgctgc gcccagcggg tgtgccacgt cttccggtgc tgcacctgca cggcggcggg 13140 
• atggtcgcgg gcacccggcg caccgacctg cacgtgctgg ccgagtgggt gtcggagctg 132 00 
ggtgtcgtgc tggtgtcgcc ggagtaccgg ctcgcacccg agcacccgca cccggcgccg 13260 
tcgcaggact gcttccgggt gctggagtgg atgtcccgca acggcttcgg cccgcctgtc 13320 
gttgcgggta catcggcggg cggcgggctg gcggctgcgg tgacgctgat ggcccgcgac 13380 
ctcggcggtc cgccgatctt gccgtcggcg cgcaactgct cgaacgcgtc cagcgtctcc 13440 
gccaacggcg tgctaccgcg ccagtgcagc aggtagacgt ccaggtgatc ggtgcccaac 13500 
cgccgcagac tgcgttcgca cgcctcgacg gcgccacgcc gacccgcgtt gtgcggatac 13560 
accttgctga ccaggaaaac ctcgtcgcgg cgcccggcga tcgcggcacc gacgacctcc 13620 
tcggcgcccc cgctgccgta catctcggcg gtgtcgatca gccccaggcc gaggtccagg 13680 
ccgcgccgca gcgcggcgac ctccgccgcg cgctgcccac gccgctcccc catgccccag 13740 
gtgccctgcc ccagcaccgg caattcctgc ccgccacgca gttgaaggcc tcgcatgcac 13800 
actccgtttc agtcagcgct gctgcggcag cacggccacc agctcgaagc caccgtcctc 13 860 
ggacggcccg gcggccaacc gcccaccggc catctgcacc cgctccgaca ggccggccag 13920 
cccggacccg ccgctgccga cctcgatggc cggtccccgc cggtcggggg cgccgtttgt 13980 
gacggcgacg tgcagcgagc cctcgcggaa caccagccgc accttgatgt cagcaccggg 14 040 
tgcgtgcttg gaagcgttcg tcaacgcctc ctggacgacc cggtaagccg cgcgctcaac 14100 
cgcaggcgac accccgtgcg ggtcgcggat ctccgaggaa agctccacat ccacaccggc 14160 
ggcgcgcgcc cgcacgacca gctccgggat cgtgctcaac ccggcccgct gcctgggggc 14220 
gccgagcccg agcgtcgcgc gcatctcgtc cagcgcctgc ttcgccaacc cgcgcagccg 14280 
cccggcggtc tccttcgcac cgggatcctc ggtggtggcc gccaacgccg ccgactcgac 14340 
cgcgattaag gtcacgtgat gtcccaccgc gtcgtggatc tcccgcgcga tccgggcccg 144 00 
ctcctccgcg cgcgccgtct cggccttcgc ctgcaactcc gcctcggtgg ccgcctccag 14460 
cttccgcaga ctgtccgcca gctgctcccg caacgccacc agcgccccca atgccgtcgg 14520 
cgcgcctgcc gccagcagca cgaacgccac ggtcaggatg atcgacccga cccccagcga 14580 
ctgcatgatc atcaccggtg ccacggcgac cgccgcggtc agcaacaccc agccgaccaa 14640 
catctggatc cgcggctgcg cgcgccccaa cgcgtacaac gccacgaccg tcggcgccca 14700 
acccaaacca ccggccaacg ctggcatgca cagcagcacc gcgatccgcg gccaccgccg 14760 
caacggcagc agcgccgaag ccaccaaacc ggacaccgtc gcatagccga acgggtcggg 14 820 
cgcggccagc acagctgccg ccggaatccc gacgaccagc gcctcaacgc cgagccgaga 14880 
ccagcgcacc gaactcaccg gaagccgccc gccacgcgct gcgcgatcaa cgcggcctgc 14940 
acccggttct gcgcgccgag cttgctcaac accgtcgaca cgtagctctt gaccgtcgcc 15000 
tcggtcagcc cgagccggac cccgatatcg gcgttggacc gaccatccgc caacaactgg 15060 
agcacctgcc gctcccgccc ggacaacccc tgcaccagct gcgtctcgtg cgcgttgtcg 15120 
gacagactgc ggaacctggg caacaaccgc gcagtgatcc gcggatccag caccgcccca 15180 
ccggccgcca gatcgtgcac cgcccgaacc agcaccgcgg gctccgcgtc cttgagcagg 1524 0 
aacccgtgcg ccccgaaccg caaggcatcg gccacgtaat cgtccaggtc gaacgtcgtc 153 00 
agcaccgccg gcaccggcgg attcgccaac gtcgccaagt cccgcaacgc cgtcagcccg 15360 
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tccttgccgg gcatctgcac gtcgatgagg 
gccagcaact catccccgtt gcccgcctcg 
aacaacaccc gcaggccacg ccgcagcatc 
ggccccccgc cgtcgtcagc ccgcacctct 
cagactagcg gggccgcgaa agcgcagcgc 
cgcccgccct tggccttccg ctcccgcacc 
aacccgaacg tctcccggaa cctgcgctcc 
ccggtggtga acagcacgat cgtcggcggc 
ttcggctgct tgccgccccg caccggcggc 
ccgttcaacc ggccggtgga gatccgggtg 
ggggccagtt tcgccaccgc gcgaccggtc 
cgcacccgca ccagctcgcg gtcgatctcc 
accaggtccc acttgttgta ggcgatcacc 
atcacccgca ggtcctgctc gctcaacggc 
tcggccgcct cgatcgcggc cttggtgcgc 
gtcttcacgc gcttgcgcag cccggcggtg 
tccaccagcg agtccaccgg gtccaccgtg 
tcttccccgg tcagcttgtt cagcaggctc 
gccacccgcc gcggcccacc ggtcgcgccg 
tccaggacgg cgtccagcag gtcaccggaa 
ggctcgccca accccagcga ccacagcgaa 
accttgttgg cggccagcag caccggacgc 
tcctcggtct cggtggcacc gacccgggcg 
tgcatggcca gttccgcctg cgcggccacc 
cagccaccgg tgtcgaccac cgtgaagcgg 
cgatcccggg tgacaccagg ggtgtcctgc 
ttcaccagcg tcgacttgcc cacgttggga 
ttcgtcggct cctcgccctc gaccgcttcg 
tcggaccacg tgccgtccag gccttccacc 
aatagtcttg gctctggtcc gcgaccagcc 
cggtgcgcac cgcctcggtc gccacgacca 
cgatcggctc cccgaacaac acgtcgatcc 
ggcggcgcgt gccccggcag gccaccggca 
acgccgcacc gtgctgcgca ttggtcacat 
ccccgacgag accaccggcc cgcagcaccc 
gcgcaccgcg gcgcaccccg atctgcccga 
ggaacatctc ctgcttgatc aggaacaccg 
gcccgtccat catcgaactg tggttggcca 
gctccgcgcc gtggatccgc acccggtacg 
ggccgcaccg gtgcattcca ccggacgcgc 
cgtccgctcc gccgcgacca gcaggccacg 
cacctccacc acacccaggt ccgtggtgtc 
cggcgccacc ttccgccccg agtccagcgc 
gtccaggtca ccggcccgtc cctcggagac 
gtgcgcggac gccgtcaggt agaccttcaa 
gtcgcgtccc tcgacgacca tgccgcctgg 
ctggtgggcg accaactgct cccgcacctc 
ggtcacctcc ggcccgcgga tctcccgccc 



gcgacgtcga cgcggtacgc gcgcaccgcg 15420 
gcgaccaccc ggatcgagcc gtcgccctcc 154 80 
gcctcgtcat cggcaagcac aacgcggatc 15540 
cggtcacctc cctgttccga gggtcgccag 156,00 
acacgccacc cgagcgcact gtgtcagcgg 15£60 
cgcaccgaga tccgcaccgg gctacccacg 15720 
aggaaccgcc ggtacccggc ctccaagaac 15780 
cgcgactgcg cctgggtggc gaacaggatc 1584 0 
gggttggccg cgaccagctc ggaaagccaa 15900 
tcccacgaat ccagcgccgt gcgcagcgtc 15960 
ttcgccgaca cgttgacccg ctcggcccac 1602 0 
ttctccaact ggtggcggcg gtcctcgtcg 16080 
aatgcccgac cggcctcgac gaccatagtg 16140 
tcggagccgt cgatgagcac gatcaccacc 16200 
agcgacgcgt agtactcggt gccgcttgcg 16260 
tcgacgaacc gccacacctg gccgtccagc 16320 
gtgcccgcga cgtcgtgcac caccgaccgc 16380 
gacttgccca cgttcggctt gccgaccaat 16440 
aagatctcgc gcggcgtctc cgggaacacc 165 00 
ccgcgcccgt gcaacgcgct gaccggcatc 16560 
tgcacgtcgg agacgcctcg ctggtcgtcg 16620 
ttggaccgcc gcagcaccct ggccacggct 16680 
tccaccacga gcagcaccgc atcggcggtg 16740 
gacgcctgca gtcccttcgc gtccggctcc 16800 
cgcccgttcc acaacgcgtc gtaggccacc 16860 
acgacggctt cccgacggcc cagcagccgg 16920 
cggccgacca cggccagcac cggctgcggc 16980 
acctcgtcga actcagccca ctcagcctcg 17040 
gactcgtcgg tcacgctcgt ttccctctcg 17100 
gatccagctc ggcgatcagc tcgaccagtt 17160 
gcccggcgcg gcctttgccc gccggcagca 17220 
tcggtagcag cctacggccc gggccatccg 17280 
ggatcagcgc ctccgacgtc cgcgccaacc 17340 
cgccatcacc ccgggtgccc tccgggaaca 174 00 
gcaccgccgc cagcagcggg gcgcggtccg 17460 
tgcggcgcag gaaccagccc agcgcaccgc 17520 
cgtggcgcgg gatcattccg aacagcagcg 17580 
ccagtaccac cgggcccgtg cgcgggatcc 17640 
gcagctgcac gatccggcgc gaaatccact 17700 
cctcgggcag ggtctcgtcg ctcatggcgt 17760 
gcactccacg tgcttgcgca actgcgtgag 17820 
gagctccacc gcgtcgtcgg ccttgcgcag 17880 
atcgcggcgc tgcacgtccg cgtgggtccg 17940 
gtcctgcgcc gtgcggcgct gcgcgcgggc 18000 
acccgcatcc ggggccacca cggtgccgat 18060 
cgacaccagc gcctcaccga tcagcttccg 18120 
ggcgaccgcc gacaccgccg acaccgcgcc 18180 
cacgtccgag ccgtccagga agatctcagg 18240 
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gccctccgga tcggtgccca ccgtcagccg cgctgcgcgc accacgcctg ccactgcggc 18300 
cggatccgac gggtcgacct cggcgcgcag caccgccagc gtcacggcgc ggtacatcgc 18360 
tccggtgtcg aggtaggtag cccccaatgc gaacgccaac ttgcgggaca ccgtcgactt 18420 
gccggtgccc gaagggccgt cgagcgccac cacaccaagg agcctggcgt gtgccacgtg 18480 
catgtccttc ccgagccgga accgatcggc "ccgacacccg gccggaccaa ccgtccattc 18540 
tgcctggtcc gctcctgccg atcccaatcg catccagatc caatccggac tgcgggacaa 18600 
atacgaaagt tctacgaata aggtcttggg tgtggatgta actgtgaccc ccttgccagg 18660 
gctcggcacg cagcaggact tcaccacccg atccggccac cgcatcggcg tgatcaccta 18720 
ccgggacggc cgcttcgagc tgatcgtctc cgaccacgaa gacccagaca aggtggccgc 18780 
ctcggtcgcc ttgaccacct ccgagaccag cacgctggcc aacctgctgg gcgccccgca 1884 0 
gctggtcgcc cagctcaacg agcagcacag cgaggtcgcc gggatcacca cctggcagct 18900 
gtccgtcgcc ccgggctccc cctacgccgg ccgaaccctc ggcgacacgg agatgcgcac 18960 
ccgcacctcc gtgtcgatcg tcgcggtcgt ccgggacggc accgtgcacc cctcaccgcg 19020 
gccggacttc cagttctccg ccggcgacct ggtggtggtc gtgggcaccg cgaaggggct 19080 
acgcgcagcc agcgaaatcc tggaaaaggg ctgatccggc ccgtcccgaa cactttcccg 19140 
gccacctgac cgggacacgc gtcaccaacg gcggtccagc gcgtgccgat cccctaacaa 19200 
ctccatacgc agggcacacg cgagaggaaa cacgttgcaa gacacggcga tctccctaat 19260 
agaactgggt gcggttttct tcggtctggg catcctcggg agactcgcat ggaaaatcgg 19320 
ggtgtcaccg atcccgctct acctgatcgg cgggctggcc ttcggcaccg gtggcctggt 19380 
ccccctgcac ggcatcgaac cgttcaccca cctcgcctcc gagatcggcg tggtcctgct 19440 
gttgttgctg ttaggcctgg aatactccgc gggtgaactc gtcaccggct tacgccgctc 19500 
ctggctggcc ggtgtcatag acatcgtgct gaacgcggta cccggtgcgc tggtggccct 19560 
gctgctgggt tggggtccgc tcggcgcgtt cacaatggcc ggcgtcacct acatctcttc 19620 
ctccggcatc atcgcgaagg tgctcggaga cctgggtcgg ctcggcaacc gggaaacgcc 19680 
ggtgatcctg tcgatcctgg tcttcgaaga cctggcgatg gccgtctacc tgccgatcct 19740 
gaccgccgtg ctggccgggg tcagcttcct cggcggactg accgcggtcg gggtgtcgct 19800 
ggtcgtgatc accctggtgc tggtggtcgc gctgcgcttc ggcaagtacg tgtccgcgct 198 60 
ggtcgacagc cccgacccgg aggtcttcct gctgcgcctg ctgggggcgg cgctgctcgt 1992 0 
cgcggggatc gcctcggagc tgcaggtctc ggcggcggtc ggctcgttcc tgctgggcat 19980 
cgcgatctcc gggtcgaccg cggcgaacgc gacccgcatg ctggagccgc tgcgcgacct 20040 
gttcgccgcg ctgttcttcg tcgtgttcgg cctcaacacc gatccgagcc agatcccgcc 20100 
ggtgctgccg gtggcgctgc tgctggcggt ggcgacctgc gcgacgaaag tgctcaccgg 20160 
ctggttcgcc gcccggatgc agggtgtcgg ccggatgggt cggctgcggg ccggtgccgc 20220 
gctggtggcg cgcggcgaat tctccatcgt gatcgccggg ctggccgtcg cctccggcgc 20280 
ggtgccgggc gagctcgccg ccctcgccac cgcctacgtg ctgctgatgg cgatcgtcgg 20340 
cccggtcgcg gcccgcgtcg tcgagccggt ggcccggatg ttcctcccca gccaggcgaa 20400 
gaaggcctga- ccggaaaagc ctccggctgt cccgagcgga cgggccggag gctttctcaa 20460 
cgctgcggat aacgcgggcg ggtctggcaa acgcaacacc ggtgcgcgaa cagcctcaga 20520 
ggcccactgc gcggtagagc gagccgatct cctggcggtt caacggccgg atogcgcccg 20580 
ggcgctcgtt ggtcagccgc acatcgccga tccgcgtccg caccaggcgc tgcaccgggt 20640 
agccgacgtg cttgagcagc cgccgcacga tgcggttgcg gccctcgtgc aacacgatct 20700 
ccaccagcga ccggttctgg ttcatgtcga ccagcttgaa ccggtccacc ttcaccagac 20760 
cgtcctccag ctccacgccc tcgcgcagct gctgcccgag gtccttcggg atcgacccca 20820 
gcacctccgc caggtacgtc ttgcgcacct tgtacgacgg gtgcatcagc cggttcgcca 20880 
gctcgccgtc gttggtgatc agcagcaggc cctcggtgtc ctggtccagc cgccccacgt 20940 
ggaacagctt gccctcccgt tcccgcaggt aatcaccgat gcacggacgc cccttgtcgt 21000 
ccgacatggt gcacaggatg ccgcgcggct tgttcagcag caggtgcgtc agatcgtcgt 21060 
tgaccatcac ccggttgccg tccacgtgga ccaccgcggt cgtcgggtcg acccggcgcc 21120 
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ccagctcggt gacgacctcg ccgtcgacct 
ccgcgcgccg cgaggccacc ccggcctggg 
aatcggcagt gctgctgtac ggggaacggt 
ggcaacaagg gggcgagcgg cggcaactcc 
aacagctcgg tcgtgcagta caggatgccg 
ataaggccgc gccccaccag ggtacggatg 
gcgacccgcg aacgcgtcac cggctgccgg 
gcacgcgtca gcttcgaacg ctgcccgtcc 
ttctcccgcg tgtagaaccg ccagccgtcg 
gcctcggcgt agccgtccga gagccgccgc 
tccagggtgt cggccagcag ctcctcgccg 
tccagcgccg catccagcgc cgaatcggag 
gcctcggaat ccccagcggc cgccgacggc 
ggggtcggcg actcctccgc gacctcgccc 
actcctcttc ttcatggctg gcgcgctgcg 
tgccgcccac ccagctcacg atcagctcac 
ccttctcccg gtagagctcg agcagcgcca 
cgcagtccga caccagctcc gagaacgtcg 
gcagcgccgc gtgctcgcgc accgaaacgc 
tcggcggtgg cttcggccgg aacaccgcgg 
ccccgatcac cacttccggc agcaggttcg 
gatagcgccg cagcgccccg gcctccaact 
cgcggtactg cagcagccgc gcgaagagca 
cctcgtcctc gacctccgcg gccggcagca 
cggcgaccac caggaactcg gtggtttcgt 
tgtaggcgat gaactcgtcg gtgaccttgt 
gctgcgagat cagctgcagc agcaggtcga 
tgaaccggcc gcctgcggcg ggatcctcgc 
tcactctccg ggctcctgca ggcgccgcac 
gtcggcgagc accacggcaa ccgcttcgcg 
gtggtcggcc cgcaatgcca gccgcgcctg 
gtacacggtg atcttcgaat cgtgcttgcg 
tgccgcctgc tccgggtccg gcggattcgc 
caattcggcc gctccgggca gggcgacccg 
agtgcgcggt acgcctgcgc cccggccgag 
gccaccgtgg tctccggaaa ccgcaccgtg 
ccgaacgcct cgacgacgcg cgccatcacc 
atggtcgcca ggatgccgct gatctccagc 
atggtgtcga tcagcagcgc caccccgcgc 
atcacggcgt ccgcggcggc caacgcgttc 
tccaccagca cgtagtcgta ctccgccagc 
tcccggccca cctccgcgac cagctgcacc 
agcaggtcca tgccctcgac ggtggtctgc 
tccatgatca cgttgtagat ggtctggtcg 
gagagcgcgc cctgcggatc gaaatccacc 
gctgcgccga ggttgatcgt cgacgtggtc 
gcgagcaccg atgcgggacc gtgccggtcc 
ggccggccag tgggaccgag gccgcggggg 



cgatccgccc ctcgacgatc atctcctcgg 21180 
acaacacttt ctgcagccgc acgccttccg 21240 
gctcagacgt catcaatcga atccacttcg 21300 
ttcaacgacg acagccccag ccgctccagg 213 60. 
cccgtctccg ggtcggttcc ggcctcttcg 21420 
acgccgtcga cgttcacacc ccgtactgcc 21480 
tacgcgatca ccgcgagggt ctccagcgcg 21540 
aacaggtacc gctccacgta cggggcgtac 21600 
ccgacccgcc gcaggtcaac gccccgttcc 21660 
agcgcggacc ggatccgggc caccggctgc 21720 
gccggcacgt cgaccacgag cagcaacgcc 21780 
gtcaggtccg gcgcactcgg cgcctccacc 21840 
tccaacggtt cctgcggctc agcgggcggc 21900 
gcggcctccg gctccagctc cgtcacccgt 21960 
cctcggcttc ggcctccgcc tgcgccacgc 22 020 
ccagcggggt ctcctgctcg aacagcagga 22 080 
ggaaccgcgc caccacctcg acggtgtgcc. 22140.. 
cgctgccctt ctccgccaac cgcacccgca 222 00 
cgtgctggtg gatgtggtcc agcgaaacgg 222 60 
ccgcgatctc ggcgaaccgc tgcggcggaa 22320 
cgaaccgctc ctcgaccgac accgaacgcg 22380 
cgccgaacag cgcggccacc tgcttgtacg 22440 
ggtcgcgggc ctccagcagc gccaggtccg 22500 
gccgggccgc cttcaggtct aacagcgtcg 22560 
tcaggtccca ccgctcgccg agcgccttgg 22620 
gcagcgctac ttcggtaacg tccagctggt 22680 
aggggccctc gaagttgtcc agccgcaccg 22740 
tcgactgcgc cgccggtacc gcggggtctg 22 800 
cagcaccgag tccgcgccgt tcttgtcgag 22860 
caccacgcgg cctcggtcca cggcgatgcc 22 920 
ctccaagccc agcaactcct cgtcggacac 22980 
ccgtccagag ccgccccggc ccgccgcggg 2304 0 
gtgatccgtc cgaggaacag tagtacggaa 23100 
acgcgtcatc gggcgatcac ctcgcgcgcc 23160 
cggggcgccc agcgggtgat cggctcgccg 23 22 0 
cggttgatca ccgcatcgaa gacgatgtcg 232 80 
tcgcgcgagt gcagcgtgcg cgggtcgaac 23340 
cfcgggttca gccgctcccg caccttctcg 23400 
aggctgaaga actcgcactc cagcgggatt 23460 
accgtcagca gccccagcga cggctggcag 23520 
gccggctgca gcacccggtg cagcgtctgc 235 80 
tcggccgcgg acaggtcgat gttgctgggc 23640 
cgcaccacgt catggatggc gaccggccgt 23700 
agctggtgcg gctggaccce gaggccgacc 23760 
agcagcaccc ggcgcccgta ttccgccagc 23 820 
ttcccgacgc cgcccttctg gttgcacatc 23880 
agcagcggcg gctcggggat gtgccgacgt 23 940 
ttcggcgcca ggccgacctc ctcctcgcgg 24000 
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cccgccgtct cccgggcgat gctcaggtcg accgcgcccc gggaccgccc ttcggcggcg 24060 
ggctgcggca gcgacatggc gatctgactc cttttggtgc cggtggcgat caaacgcagc 24120 
ctaagtggca ttcgttacca gcggcaacgc gcctcgccgt ggtgtcggaa gttccctgtg 24180 
atctgcgtgt gatcaacgtc taccgcgtac accaccccct cgcattcgca gcgtcaccaa 2424 0 
gtcactccgt agcgtgtgcg* cgcggatgcg tcgtcgcgta gacctcccgc agcgtgttca 24300 
ccgtgatcaa ggtgtagacc tgggtcgtcg tgaccgaggc gtggcccagc agttcctgca 24360 
ccacgcggac atccgcgccg ccctccacaa ggtgcgtggc gaacgaatgc cgcagcacgt 24420 
gcggcgaaac cgagccgttg atcccggcgc gctgcgcggc ggtcttcaac gcgttccacg 244 80 
agctctgccg ggaaagccgg gtaccgcgcg aattcaggaa caccgcggca ctaccccgcc 2454 0 
cgcgcgcggc caggccgggc ctggcccgta ccaggtacgc gtccagcgcc gccagcgccg 24600 
gtcggccgat gggcaccagc cgctggcggc cgcccttccc ttccagcagc acggtccggt 24660 
tggtgccgtc gatgtcgtcg aggtccaacc cgaccgcttc ggagatccgc gctcccgtcg 24720 
aatacagcac ttccagcagc gcccggtccc gcaggccccg ggcatcctcg cccccggcgt 24780 
gatcgagcag cttcccgacg tcgccgatgg aaagcgcctt cggcagccgc ttcggcgggc 24840 
tcggcggcgc gacatcgcgc gctacgtcga ccggcagcag cccctcggcg tgcgcgaagc 24900 
ggtgcaggcc gcgggcggcc accaaagctc gggccgacga cgacggcgcc agcgcgggcc 24960 
gctgcccggt gccttcccgg agctccgcca agaacccgga caggtgctcc gaacgcacct 25020 
cgccgagccc agaaatccct gccgcgacca ggaattcggc ataccggcgc aggtccctgg 25080 
cgtaggagtc gagagtgctg cgggcggtac cgcgctcgac ggcaaggtgg tcgaggtacc 2514 0 
cggtgatcgc cccgcgcagg tccggcggca gctcctcgaa acccctcacc gggcggcgaa 25200 
gcgctggggc cggtccggcc attcggcatc cgccgggcga ggctccaccc ggccctcgcg 25260 
gaccgcgtgg gcggcgagca gccccgccac cccggggccg ttgacgatct ccccggccag 25320 
cgccatccgc accgcctcgg cgagcgggaa gcggcggatc 25360 



<210> 4 
<211> 45624 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 4 

tcagccgacc ggcttccgcg ccgtcagaat 
ataatcgcag accgcggcgg cccagccggc 
gtgctggtgc gcagcgagct cttcggcgaa 
cacgtcctcc caatcgagga tctcgaaccc 
agccaggcag atccgaaggc cggtcggcca 
cgcttctcgt ttgacgacct cggtgacgcc 
gatttcccgg atggcacggt ccggttcgga 
ggcgtcgaaa gcattgtccg ggtacggcag 
ccggtggctt agtccgcgtt cgcgtgcgca 
gacggtgatg ccggtgatct ggatcgcgtt 
ggtaccgcac cccacatcga gcagtcgaac 
aaggtcggtg agccggtcgg cggcctgctg 
gccgtggtgg atggcgcagg ggccgcccgc 
atacatctgc ccaacctgct gcgatgttgg 
tctctgcaac gttcaccttc tcgtaatttt 
aaagacggca aactagccgt cagttaattc 
tataactatc cggccaggag gcaaacagcc 



cgcatagccc atgtcgtggg catatttctc 60 
gacagccggc ccgtacctgt ccgcgatccc 120 
ctgcggcatg aagtaccggg tcctcgacga 180 
cgctgcacgc agcgattcca gaagttgctc 240 
cctgtccccg gacaccggca tcccgccgcc 300 
gaggatgcca ccgggtttga gtactcgaag 360 
catctccaac agcgactgca tggcccaggc 420 
ggacatggca tcgacgcacg agaagtccac 4 80 
atcagcggcg atggccactt gcacctggct 54 0 
gtcgcgcgcg acgcgcagcg ctggttgtcc 600 
gccgccatcg agcacggtcc gttcggcgac 660 
ccaggaagcc cgcccgtcgt tctcccagta 720 
gaccgagttc agcaacggcg tgaccaggtc 780 
tgcgccacct ggcaacaccg gtatgcctga 840 
ctggaaccga gcaggaaaca cacagagaga 900 
gcggttaccg ccgcatgcgg cggtaccgac 960 
agcgcaaatc ttccccatga atccccgcag 1020 
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aaccaacggt gatcaccgta caggtccggc ccaccccgat cgagcggagc tcgcgcattt 1080 
cactcgttcg cgtgaaacca ggtgcggcga cccaaagatc ttcgatggtc gcgtgacatt 1140 
cctgccgtga cacactattg ttgccgccgg aagtccgttg gacgtcaacg gcgatcgaca 1200 
tacgcagcgc attttccaaa tcctggccaa tgggagtagg ggctgctgcg ctaatcgggg 1260 
gaaagagact ggggtccgaa gtatgcgcgt actcgtcgtt cccttgccct atccgacgca 1320 
tctcatggca atggtgccgc tgtgctgggc gctgcaagca tccgggcacg aggtcctgat 1380 
cgccgcacca ccagagctgc aggcgaccgc gcatggtgca ggtctcacca cggccgggat 144 0 
ccgcgggaac gacaggaccg gcgatacggg tggaaccacg cagctgcgct ttcccaatcc 1500 
ggcgttcggt cagcgcgaca ccgaggcagg ccggcaactg tgggagcaga ccgcgtccaa 1560 
tgtcgcgcaa agctcgctcg atcagctccc cgaatacctt cgactggccg aggcctggcg 1620 
accgtcagtg ctgttggtcg acgtctgcgc gctgatcggc cgggtgctcg gcggattgct 1680 
cgacctgccg gtcgtgctgc accgctgggg agtcgacccc accgcaggcc ccttcagcga 174 0 
tcgagcccac gagttgcttg acccggtgtg ccggcaccac ggactgaccg gcctgcccac 1800 
tcccgagctc atcctcgatc cctgtccgcc gagcctgcaa gcaagcgacg cgccgcaagg 1860 
cgcaccggtc cagtacgtgc cgtacaacgg aagcggcgca ttcccggcat ggggcgcggc 1920 
gcgcacctca gcacggcggg tctgcatctg catgggccgc atggtgctga acgccaccgg 1980 
gccggctccg ctgctgcgcg cagtagcggc tgccaccgag ttgcccggcg tcgaggccgt 204 0 
gatcgccgtt ccccccgagc accgggcact tctcaccgac ctacccgaca acgcccggat 2100 
cgccgaatcg gtcccgctca acctgttcct gcgtacctgc gagctggtca tctgcgcggg 2160 
cggctcggga acggcattca ccgcgacccg actcggcatc ccgcaactcg tgcttcccca 2220 
gtacttcgac cagttcgact acgcgcgcaa cctcgccgct gccggggcgg gcatctgctt 22 80 
gccggatgag caggcccagt ccgaccacga acagttcacc gactcaatcg caacggtgct 2340 
cggcgacacc ggcttcgctg ctgcggcaat caaactcagc gacgagatca cggccatgcc 2400 
ccatcccgcc gcgctggtgc ggacgctgga gaacactgcg gccatccgtg cctgatgaac 24 60 
tagtcacccg agaacagctg gatccggaga accgatgccc tcccagaacg cgctgtacct 2520 
ggacctgctc aagaaggtac tcaccaacac gatttacagt gatcggccgc atccgaacgc 2580 
ctggcaggac aacaccgact acaggcaggc cgctcgggcc aaaggcacgg actggccaac 2640 
.tgtcgcgcac acgatgatcg gtctggagcg gctggacaac ctccagcact gcgtggaagc 2700 
cgtgctcgca gacggtgttc ccggggattt cgccgagacc ggtgtctggc ggggcggcgc 2760 
atgcatcttc atgcgcgcgg ttctccaggc attcggagat accggacgta ccgtctgggt 282 0 
agtggattcc ttccagggaa tgccggaaag ctctgcgcaa gaccaccaag cggaccaggc 2880 
tatggcgctg cacgagtaca acgacgtgct tggcgtatcg cttgagaccg tccggcagaa 2940 
cttcgcccgc tacgggctgc tcgacgaaca ggtcaggttc ctccccggct ggttccggga 3000 
caccttgccc accgccccca tccaggaact cgccgtgcta cgactcgacg gcgacctcta 3060 
cgaatccaca atggactcat tgcggaacct gtacccgaag ctctcgccgg gcggattcgt 3120 
catcatcgac gactattttc tgccgtcctg ccaggacgcg gtgaaggggt tccgcgcgga 3180 
actcgggatc acggaaccca tccacgacat cgacggcacg ggcgcctact ggcgccgcag 324 0 
ctggtgaacg gctcagccgt cctcgacgcc gagcgcttgc cggggcacga accccggcgc 3300 
ggcaggctca gcgttgagcc ctttctccac gaataccagg ttgtggtaga agtgcagggc 3360 
cgccacgttc cgttccgtgt aggagggctc ggtcccgcac cgcgattcgc gctcctgata 3420 
atgcaggccg tcgatcagtt ctttgagcat gtcgatcgag gtgcgctggg ccgcgggttc 34 80 
cccatcgcga ccgccgtagc cgggccagta cgccgtctgg agatcctcga tgacgtacaa 3540 
accacctggg cggacgtgcg gaaacaggga ttggaaggat ttcttgacgt ggtcgttgac 3600 
atggctgccg tcgtcgatga caatgtcgaa cggcccgatc ttccccgcca tgtcttccag 3660 
gaattccgca tcgctctggt cacctcgcag ctttcgcact cggtgccctt cgttcccggc 3720 
tttctcgaaa atgtccagcc cgtaaacgag acctcgccgg aagtaccgct gccacatgcg 3780 
cagcgaagca ccaccgagtt cgggtgcgtg ataaccaccg attcctattt ccaacactcg 3840 
caccggggca tcctggaatc gggagaagtg gtgctcgtag tgttcggtgt accagtgcag 3900 
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gtccgcccat ttgtcggatc cgtagcggac 
ggcggctgcg gccagcacgg cgtgcaccgc 
gcgctgcgcg ccggcgaggc cctggtcggg 
gacttcacgg ctgttgtgcc gcctgccgtc 
caccgcgtcg aaccggagaa cagcagctgc 
gtgatcgagc tcaacggttc tccggaccag 
tcgcaccgac tcgtccaacg acagcggatc 
gtcggccacc gcgtccagtc cggccacctg 
acccgccgcg agcagcaaac gctccaccac 
ggaaagtcat caccttttcg gttccagcgc 
ccacaaagta gagccggcgg aggcgaccgg 
atcacccgga cgcgtagatg atctcggctg 
gaggggcggt gcccgaattc catgacccgg 
agatcaccgt cgagcccgac gatcctcgtt 
gtttcaccgg aaaacccgaa cgcatccaca 
ccgtcgccga cgccgtgcgc accggcaggc 
tcgagaatct cgttgcggac ccggcgatcc 
gcgtgtacta cgacagcacg cgcggggcat 
aggtgtaccg aaccctgttc aagaactggg 
gggtgggcgc aggcgggcac atcctcggcg 
gttcggtcgt cgactacctt caaggcgtcg 
tgcacatcgt cgaggccgac cggaactcca 
acaccggtgg cggtggcggc aacttcggga 
acgtggtcag caccgacgcc gcagagctcc 
gatcgttcca ctggccgtgg cacgaactga 
acttcggcaa ttggtacgag cagcacagcg 
gcacgctcgt ctgcgcacac cggcaagctg 
gcacggatcc gaacgcggaa cgcaccctgg 
tcggcgtgac tccagccgaa gggctgcggg 
tggccggggc gatcgccgaa ggcggcgaac 
cctacttgcg caccgggctg tccgaagctc 
tc-tacggata cgacaaccct gcggcggcgc 
atgccgtggc tccgtcggcc accgcactcg 
tcgtcacgaa ctggtcggag cccgccgagg 
tctaccgcga gatgtacgcc gaaaccggcg 
gctcctacat caactacccg gacaccgact 
ttgcctggca cgacctgtac • tacaaagaca 
ggtgggatcc gcagaacatc ttccagcacg 
cacccggtca gccatgagga gctcatcacg 
gaacgcatca tcctcgccgc cggatccagt 
ctcggactcg ccaggatcgc acccgtgctg 
gcccccgaca tcgaacggac cgaggtcgcg 
gacttcgtcc tgacgctaca gtccggtgag 
gacgtcccgc tgcggatcgg ttacgagctc 
ggagctccca gggccgtcgg cgcccggagc 
tcgatacccg gtccgtcgga actgtccgat 
ggctgcgggc accgacgtcc cgacctcaac 
tggggcggcc tgcactggtt caccccgcta 



tgccagcgca ccgaggtccg cgcgcctggt 3960 
ggtggacacc ctgttccgga acgccagcag 4020 
cgagaactgc gtcatgctgt ccgaccagcg 4080 
aaccgggccg aagagccctt ccagcagatc 4140 
ttcggcgacc gccgcgagcc *g£aggcctgc 4200 
ctgagcgcca gaagtgatct ccaggccgat 4260 
gcagcgtacg acgagttcgt cgacgatggc 4320 
cactgcttcc tggagcttcg ccgggcccgc 4380 
cgaccagggg gcaactgcga tctcactcat 4440 
atacggcacc atgcgcaccg cgatatcgat 4500 
gtttcacgta tgagctgaaa cacacagata 4560 
cgggcgaaca aagtggacca gtcagaaaag 462 0 
cacccatgaa tcgtcgaacc ccaggaacag 4680 
atccggacct cgtcgtcggg cacaaccccc 474 0 
tcgccagctc cgccgaagac gtcgtgcacg 4800 
gggtaggggt ccgcagcgg'c gggcactgct 4 860 
gagtgctcgt cgacctctcc gagctcaacc 4920 
tcgcgatcga ggcgggcgcc gccctcgggc 498 0 
gcgtgacgat cccgaccggc gcatgtcccg 5040 
ggggatacgg cccgctgtcg cgccgattcg 5100 
aggtcgtcgt ggtcgaccag gccggtgaag 5160 
cgggcgccgg tcacgacttg tggtgggcgc 5220 
tcgtcaccag gttttggctc cgaacgccgg 52 80 
tgccacggcc gcccgcgaca gtgctgctcc 5340 
cagagcagtc attcgccgtg ctcctacaga 54 00 
cgcctgaatc cacgcaactc ' gggttgttca 5460 
gctacgtcac gctgaacgtt cacctggacg 5520 
ccgaacacct gtcggcgatc aacgcccagg 5580 
aaaccctgcc gtggttgcga tcgacccagg 564 0 
cgggcatgca acggaccaag gtcaaagccg 57 00 
aactagccac ggtttaccgg cggctgaccg 5760 
tgttgctgct cggttacggc ggtatggcga 5 820 
ctcagcgcga ctcggttctc aaagcgctgt 5B80 
acgagcggca tctgacctgg attcgcggtt 594 0 
gagttccggt gccaggtacc cgtgtcgacg 6000 
tggccgatcc attgtggaac acctccggtg 6060 
actacccgcg gctgcagcgg gccaaagcgc 6120 
gcctgtcgat caaaccgccg gcacggcttt 6180 
atgtccacaa cgcacgagat cgaaaccgtg 624 0 
gcggcgagcc tggccgacct gaccaccgaa 6300 
atcgacgaga tcctcttccg cgcggaaccg 6360 
gtccagatca cccaccgagg cgagaccgtt 642 0 
ctgatcaagg ccgagcaacg accggtcgga 6480 
accgatctca tcgccgagtt gttcggccca 6540 
accaacttcc tccgaaccac cacatccggt 6600 
ggcttccagg ccatctccgc agtggtcgcc -6660 
ttgctcgcct cccactaccg cacggacaag 6720 
tacgagcgac acctcggcga gttccgtgat 6780 
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cgcccggtgc gcatcctgga gatcggtgtc 
gaatccctga agatgtggaa gcgctacttc 
ttcgacaagt ccttcctcga ccagcagagg 
cccgaggagc tggccgccgt tgacgacaag 
ggcagccaca tcaacggaca cgtgcgcaca 
agcggtggcg tatacgtgat cgaggatctg 
caggcgcagt gcccggccgc acccggcacc 
ggcgttcagc acgaggagca gccgcatgcg 
aatttggtcg gcctccacac ctaccacaac 
gaaggcggcg ttcctgcttg ggtgccaagg 
gtgaacagcg cggaggacga gtgaacagca 
cggatcaggt ggagtccatc ttcgatgcgt 
actgggcggg cgggtatcgg gaggatgccg 
aactgaccga cctgttcatc gacaaggccg 
tgggctgcgg caatgggcag cccgtagtcc 
ccggaatcac cgtgaacgcc cagcatctcg 
gactggccgg cagtcttgag ttcgatctag 
gtttctttca ggccgcatgg gcgatgcagt 
cgatccgcga ggtccaccga atcctggaac 
tcactcgggt tcgactcccg gaagagtacg 
ccttgaacag cttcacggcg ctggtcagcg 
acctcacggc acagaccagg tgcatggtct 
tcgatgagct cgccggcgtc gagcctgcgg 
gagacatcgc ggcgaagcac ggaccgggac 
accggaaaca tccggattac gccagaaacg 
ctcgaaagaa gcagtcctga tggcctccga 
ggcacccgcg gatgatccct tctaccgacc 
cacgctcctc agggcccggc ccgtctcggc 
caaggcctgg caaatcctct accggtccaa 
cggcaccgtt ctggtgccga acatcccgtg 
cgcagtgggc acccacggcc tcggtagcca 
aaccgagccg gagaccgagc tgatcgccgt 
caccgactac gaaggcctcg gtactcctgg 
gggacacgcc atgctcgatg ccgcccgcgc 
gaccgactgc ccggtcggca tctggggcta 
cggcgaactg caccccacct acgcacctga 
ggtgccgatc gatctgctgg acatcateca 
gctggccggc ctggtcgggc atgccgctgc 
caccgaagcg ggtcgtaccg ccgttgatca 
cacccgcttc ctcggccgcg agctgagcga 
acctcgatgg cgcgcacgac tggccgaaag 
ccccacgctc gtctaccaca gcacggacga 
actccgggac agctaccgcg cggcgggtac 
ggctcacttt cccgccgccc tggccagctc 
cttctccgag ccgtccgcga tcagcggtcc 
gcctctccag cggcttcccc gctggcttcc 
ctacccgcgg atgccgggcg gctatgtgga 
caccagcgcg gcctgctgga tcatcgattc 



ggtggctaca 


acttcgacgg 


tggcggcggc 


6B40 


caccgcggcc 


tcgtgttcgg 


gatggacgtt 


6900 


ctctgcaccg 


tccgcgccga 


ccagagcaag 


6960 


tacggaccgt 


tcgacatcat 


catcgacgat 


7020 


tccctggaaa 


cgctgttccc 


ccggttgcgc 


7080 


tggacgacct 


atgctcccgg 


attcggcggg 


7140 


acggtcagcc 


tgctcaagaa 


cctgttggaa 


7200 


ggctcgtacg 


agccgagcta 


cctggaacgc 


7260 


atcgcgttcc 


tggagaaagg 


cgtcaacgcc 


7320 


agtctggacg 


acatattgca 


cctggccgac 


7380 


gaggggcgaa 


cacacaggca 


tttccgaccg 


7440 


tggcgcacgg 


gcgtcccctg 


caccacggtt 


7500 


gtgccacacc 


gtggtcggat 


gctgccgacc 


7560 


cgctccgtcc 


cggagcgcac 


ctgttcgacc 


7620 


gtgcggcatg 


cgccagcggc 


gttcgagtca 


7680 


ccgccgccac 


caggctcgcc 


aacgagaccg 


7740 


tcgacggcgc 


ccagctgccc 


tacccggacg 


7800 


ccgtcgtgca 


gatcgtggac 


caggccgccg 


7860 


ccggcggccg 


gttcgtcctc 


ggagacatca 


7920 


cggcggtttg 


gacgggcacg 


accgcccata 


7980 


aagccgggtt 


cgagattctc 


gaagtcaccg 


8040 


cctggtacgt 


cgacgagttg 


ctccggaaac 


8100 


ctgtcggcac 


ctaccagcaa 


cgctacttgg 


8160 


cagcacagct 


gatcgccgcg 


gttgcggaat 


8220 


aggaaagcat 


gggtttcatg 


ctcctgcagg 


8280 


gcacgccagc 


ctggtcggcg 


acgatctgcg 


8340 


gccgacgccg 


ctaccgccgg 


gtgtcccggg 


8400 


actgcgcggc 


acgggcgaac 


ccgtcgcagc 


8460 


ctccgccctt 


ggcatgccga 


acgccgtctc 


8520 


gccgcgcgaa 


gatcgcccca 


tcatcacttt 


8580 


agttgccccg 


tcgtacctgc 


ttcgaaccgg 


8640 


ggccctcgac 


cgcgggtggg 


ccgtggtcat 


8700 


aacccacacc 


tacaccgtcg 


gcagggcgca 


8760 


tgcgcaacgg 


ctaccgggct 


ccggcctgac 


8820 


tgcgcagggt 


gggcaagcgt 


cggccttcgc 


8880 


actgcgaatc 


cgcgctgcgg 


ccgcaggtgc 


8940 


ccgaaatgac 


ggggtgttca 


ccgggccggt 


9000 


ctaccccgat 


ctgccattcg 


acgagcttct 


9060 


agtgcgcgag 


ctcggtgcac 


cggagctcgt 


9120 


cttcctcgac 


acttccggcc 


ttttcgagca 


9180 


cgtcgcaggt 


aggaacggtg 


gcccggtggt 


9240 


cgagatcgtt 


ccgttcgcat 


tcggcgagcg 


9300 


gccagtgcgg 


tggcatccgc 


tctccggatt 


9360 


gcgagtggtc 


gtctcgtggt 


tcgacgagca 


9420 


gcgagatgcc 


aggtgagcgg 


atggcggtga 


9480 


ggatgcccgg 


tcaggcttcg 


aggcgaacta 


9540 


cccgcaccga 


cgggcctgac 


cgatcgcctc 


9600 


gcccgaatct 


ccggccaccg 


cagggtcgtc 


9660 
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cacgcctgtc cctgctctga tgtcgcgtgc gaaggcggtg accgccttgc gaacctgatc 9720 
ttccgctggc aaggacaact cgtcgacaac gcccttccgc tcgattcgga tcacggcctg 9780 
ccactcggcg ggcggcgtga acgcccggtc gacgacgatt cgcccacgac tcccccacag 9840 
ctcgtacgcg ctgcggtagt ggtgcacgaa acegtatccg aggtgggcaa cggtgccacc 9900 
ttccgattgg agcagcacgc tgcccgacaa gtcgacgccc gactcctggg cctcgtgcga 9960 
gcttgcgccg agaaccgtga gcggaccgag gagaaagtgc cgagcggcac gggcgggata 10020 
gacaccgatg tccagcaacg ctccgccacc gagttcggtg cgatagcgga tgtccgtgtc 1008 0 
gggaagcggc ggaattccga acacggcggt gaactcccgg agctcaccga tctccccgga 10140 
ttgcagcagg tcgcggacca cgtcgtgccg gccgtggtgg aggaacaggt aattctcccg 102,00 
cagcagcagg ttcttcctcc tggccagccc gaccaggcga gcggtgtcgg acgccgtcgt 10260 
cgtcagcggt ttctccgcaa gcacgtgttt gtctgcctca agcgccttgc cgatccactc 10320 
tgcatgcatg ccaggcggca acggcacgta gacggcatcg atgtccggcc gctccaggag 103 80 
ccgctggtaa cccagcaccg cctcgcattc gaatcgcgct gcgaaccgtt cggctttcgc 10440 
cggatcacgg ctcgccaccg ccaccacctc tgtttcggcc acgtcgcaca tcgcgggcag 10500 
catccgtcgc cacgcgaagg aagcgcaccc gagcacaccg atgcgcaccg gctttcgcat 10560 
cgagctggtc atcgccccaa cgcccacaag ctatgcaggg aggcgaccaa gctgcgcgcc 10620 
tggatgttca agaagtgggt gctccggagc agctcgccca actgtcccaa agtcatccac 10680 
cggaagtcgc tgggaggtcg tgccgcgaag tcctcatgca cctcgatgat ccggtacctg 10740 
ttctgcgcct ggtagaaccg accgccttct tcggacagga tcgattcgta ccgcacggtt 10800 
tcgggatcgg cggtgagcac gtcgtccaag aacggcggcc agtcgttcga cggggtgctt 10860 
tggtagttgg ccacactgca ctggaccgtg ggagcgattt ccgcagtcca cttataccca 10920 
gcctccaccc gagcgtggac caaaccgtgc agcactccgc cgatccgctt gaccaacagt 10980 
gcgatctcac cttgttctcg cggttcgatc atcggctgcg tccagctggc gacctcacga 1104 0 
ttggtcgcgg acaccgacac cgcgatcacc gcgaagtact tgccgtcctg gtgggcgatc 11100 
tcggtatcgg tgcgatacca cttgtcgacc ctgctcagcg gaacgcgcgt tgcccgcaag 11160 
cggtagcggg ccttggcttc ctcgaaccaa ccgaccgcct cggagatact cgacgaatcg 11220 
atgccgtgcg agagcgacct ggccaccgcc tgtcggaagg gctcctccga ggcggctagt 112 80 
tcgggtccgg tggcggaatc gtggaacggg atgcaagaca gcaccgtccg ggtgtccatg 11340 
ttgacgatgt tgtcctgacg aaggagagcc agcacctggc cgagcgtcaa ccagcagaag 11400 
tctggcagga ctggcacttc ctcctggact tccaccacca tgttccggtt gcgcttccgg 114 60 
tagaaccagg acccctgttc agactggagc acgtctacca gcacgcggcc gcggccccgc 11520 
ccgaggaagt agtccacata gggcggaacg ctgccacggt gtgcctgcgt gtagttgctc 11580 
cgagttgcct gaaccgtcgg cgagagctgc aggacgttga cgttgcccgg ttccatcttt 11640 
gctgacatga ggcagtgcag cacgccgtcg atctccttga cgagaatgcc gaggatacct 11700 
acttcagcct ggttgatgat cggctggtgc cagctgggtg ccgcgccata gttggtctcg 11760 
acctgcaggc cttctaccgt gaagaaccta ccgtcggcat gaaccaggtt ctccgtgctg 11820 
gcatcgaatt tccatttcga caggcggtcg aacgggatgc gattggtctc gaagctgttc 11880 
tcgcccagcc gatcggcgag ccagcagtgg aactccgtgg tcggaatcat gccattgcaa 11940 
gcgctcagcg cagagtcgac gaaccgccgc gtgttgttgc tgccgagcgg cgcagcagca 12000 
cttgcctcag cttcgacaga actgctcatg cccaatccct cactgccgtg acgatgtgca 12060 
cgccggctcc caggggcgtc gcggtctggt ccagcggcct ggcgaggatc atcgaagcga 12120 
tccgcccgta aggaccgcga gcacctcgtc ctggcgttcg gtcatgaaga aatgcccgcc 12180 
ggagaagaca cggacgtcgg ccacggcctc ggtgtgctct cgccaggctt ccgcctcatc 12240 
caaggtgacc ttcggctcgg catctaccac cagcacggtg atcgggcaac ccaccttcgc 12300 
tcctggcgca cagcggtatg tctcgaccgc acggtagtcg ttgctcagcg ggcgccacga 12360 
tcctcgtcgg aaatcggttg gagcgcaaac ctctggggag atcaacaccc cattggacga 12420 
ttctggacaa gaatctgact tcgggactcg cctggcatcg tcccggtcac ggcggaatcc 124 80 
gagtcgacgc aacagcgatg aacacctgcc aaaccccggc gtccatgttt ccggccacac 12540 
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gttgctcacg gatggccatc agactgccca 
ttttccagac gtggaacgat gtcgtgcggg 
agcgccgcgt tggcagcgaa cgacggctcg 
tcgtcttcgg taaacgtcgc ggggtcgagg 
tccgccctac gagattcgtc ccagaaggtc 
accgtggcgg tttcctgcgt cgtcgtcgag 
agcagttcgt tgagcggtac gtattcgtgg 
tcccgtactt cgtcgtcaga caaggtggcg 
cgcaacaacc tttccaccat tgcttgttcc 
accctgctca gccgccgctt ggtcagcccg 
cgttcgcgca accactccgg caacaccgcc 
atgtagtcca agtccacagg caggcgcatc 
cccgtcacga tctcttcatc gaacgtggct 
gtcccgagcg ggtcgacgcg ctgctcaggc 
gaccgcagcc accccgacac atcgagggcg 
gctcgcgcca caactggccc cgaacacacc 
catttctcgg cgaaccccat gagatcgtcc 
tccacggcgt ccgtgtctct gccggagttg 
tgccgcccct tctcgtcgaa cgagaccccg 
aaggagatga ggtcgagctc gtcgccgacg 
atcgacacca tattcgggca gatggcgaca 
cacgccaacg gcaccaggtt gaagaagtgc 
cgcatgaact ctccgatcgc aattgaacac 
tctgatatcc cccgggaaac gccgctttcg 
gtagcgatct tgacaagcgc gagcaggtcg 
gtgtgccggg tagatcgagg atttctgagt 
tgatgtcccg caccgcatcg aaccagggca 
gtcccgctgg gacgtcataa cgcgattcgc 
gcgcgaagac tgatcgtcgc ctgcatgcag 
cgcgccagag tgacttcgtg tgacgacact 
gccaaggacc ggatattggc atccgtacgc 
ttcgtggctg gatcgacgcc gatccggcca 
gtggcactgg tggaagccgc gctggagctc 
ttcgagagcg agttcgcccg cttcttcggc 
tcgtcggcca atctcctggc actgagttcg 
ctgcggcccg gcgacgaagt gatcactgcg 
gcggtccaaa acggactcgt cccggtattc 
acgccagacc gcatcaaggc cgccgtcacg 
accctgggca accccttcgc cgctgacgaa 
ttcctcgtcg aagacaactg tgatgcggtg 
accttcggcg acctgacaac ggtcagcttc 
ggtggctgcg tgttgaccgg cagcctggaa 
tggggacggg attgctggtg cgagcccggc 
taccacctcg gtacccttcc accgggctac 
tacaacctca agaccaccga cctgcaggcc 
tccgcattcg ggtcggcacg ccgccgtaac 
ttgccgggcc tgctgctgcc ggtagccaca 
gcgatcacca tcagtgcgga cgccgggttc 



gtgcggcggt tctcacgttc ggcaactagc 12600 
ctgggacttt cctcgatttc acggcggatc 12660 
tcgagcaggc gggccagctg acctcgcacg 12720 
accagaccgg ctccccgatc ggctaggagc 12780 
ccagggagaa tcaactgcgg tacgccgttg. 12840 
ccatgatgga tgatcactga acacgactcc 12900 
acccggacgt tcgagggcaa ctcccccatc 12960 
atcacctcga cgtcgagcct ggccgcgccg 13 020 
tggtcacttt gctccccgta ctgttcggtc 13080 
cgcgtgatgc agacgcgcgg cttcgtcggt 1314 0 
ggaccgttgt acggcacgaa acgcatcgag 13200 
caggatggaa tcggatctat ggtcgcttgg 13260 
ccgaacttgg cgagcttcgc tcccagccac 1332 0 
ggcttcgatt cctggtattc gaggaaaccg 13380 
acgagcatcc gcacgtgtcg tgcgccgagc 13440 
atagcgtccc acacaacgag atcaggctgc 13500 
agcgatcggt catccacaag gtggagttgc 13 560 
attgacagca gctcgtcgaa gagttccgga 13 620 
ctgccgagaa cgagttcgtt cttggccgcc 13 680 
ggaaccgcgg tgagtcctgc tccggtgacc 13 740 
cggacctcgt gccccgcggc acgcaacgcc 13 800 
gaactcgccg gcagcggggt gaacaggaca 13 860 
ccgggaagca tgccaagaat cacagaaatc 13 920 
caagccaaat cttaggcctt ccaggtgatg 13 980 
ttcccgctag cctgggctct accgagtcgg 14040 
caatgagcgc ttctccttgc tccgctgtcc 14100 
ggaaggtgta aggcgccgag acagcacact 14160 
cacgggcatc gctcatctcc tgaaggcaag 14220 
agccggaaaa ccagagcgct ggggaaaggg 142 80 
tgcgctaccg ctactgagat ggtgccggat 14340 
gattaccacc gcgaacagga atccccgacc 14400 
tcgggcgccg tgctcgacga ggacgaccgg 14460 
cggatcgccg cgggcgggaa tgcacggcga 14520 
ctccgcaagg ctcatctcgt caactccggt 14580 
cttacctccc ccaaactcgg cgaggcacga 14640 
gcggtcggct tccccacgac gatcaatccg 14700 
gtcgacgtgg aactgggcac ctacaacgca 14 760 
gaacggacgc gagccatcat gctggcgcac 14 820 
atcgcggaga tcgcaaaaga acacgagctg 14 880 
ggatccacct accggggacg gctgaccgga 14 94 0 
tatcctgccc atcacatcac cagcggcgag 15000 
ttggctcgca tcatcgagtc gctgcgtgac 15060 
gtggacaaca cctgccgcaa gaggttcgac 15120 
gaccacaagt acacgttctc ccacgtcggt 15180 
gcacttgcgc tgagccagtt gagcaagatt 1524 0 
tggcgacggt tgcgcgaagg gctgtccggg 15300 
ccgcacagcg acccgagctg gttcgggttt 15360 
acccgtgccg ccctggtgaa cttcctggaa 15420 
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tcccgcaaca tcggcacccg actgctgttc ggcggtaaca tcacccggca cccggccttc 154 80 
gagcaggtgc ggtaccggat cgccgacgcg ctcaccaaca gcgacatcgt caccgaccga 15540 
accttctggg tcggcgtcta cccaggcata acggaccaaa tgatcgacta cgtcgtcgaa 15600 
tcaatcgctg aattcgtggc caagagttcc tagcatccag catggcggca tctcggagga 15660 
tttcagcaac gtgatcaacc tgcaccagcc gatcctcggc accgaagaac tcgacgcgat 15720 
cgcggaggtg ttcgcctcca actggatcgg gctcgggccg cgcacccgga cgttcgaggc 15780 
cgaattcgcc caccacctgg gagtggatcc cgaacaggtc gtgttcctca actcggggac 15840 
tgccgcgctg ttccttaccg tgcaggtgct cgacctcggc ccaggcgacg acgtggtact 15900 
tccttcgata agcttcgtgg cggcggccaa cgccatcgca tcctccggtg cccgcccggt 15960 
gttctgcgac gtcgaccccc ggacgttgaa ccccacgctg gatgatgtgg cgagggccat 16020 
cacgccggcg accaaggccg tat.tgctgct ccactatgga ggatcgccgg gagaagtcac 16080 
cgcgatcgcc gatttctgcc gtgaaaaggg cctcatgctc atcgaggact ccgcctgcgc 16140 
ggtggcatcg tccgtgcacg gcaccgcttg cggaaccttt ggtgacctgg ccacgtggag 16200 
tttcgatgcg atgaagatcc tggtcaccgg ggatgggggc atgttctacg cggcggatcc 16260 
ggagctggcg caccgcgcaa gacgactcgc ctaccacggt cttgagcaga tgagcggatt 16320 
cgattcggcc aagtcttcca accgctggtg ggatattcgc gtcgaagaca tcggccagcg 16380 
gctgatcggg aacgacatga cggcagcgct tggcagcgtg cagctgcgca aactgccaga 16440 
attcatcaac aggcgtagag aaatcgctac gcagtacgac cggttgcttt ccgatgtgcc 16500 
gggtgtcctc ctaccgccga cgctaccgga tgggcacgtc tcgtcacact acttctactg 16560 
ggtccagctg gctccggaga tccgcgacca ggtggcgcag caaatgctgg aacgcggcat 16620 
ctacacgagc taccgctacc cgcccctgca caaggtcccc atctaccgcg cggactgcaa 16680 
gctgccttct gcggagcacg cctgccgcag aacactcctg ctaccactgc acccaagcct 16740 
tgacgacgcc gaggtgcgca cggtggctga cgagttccag aaggccgtcg aacaccacat 16800 
cagccaaaga tcaccactcc gaaagtgagg atgtcgcgcg tgagcgacac attcgcagaa 16860 
acctcctcgg tatacagccc agatcatgcc gacatctacg acgcgatcca ctccgcgcgt 16920 
ggccgggact gggcagccga ggccggggaa gtagtccagc tcgtacgcac caggctgccc 16980 
gaagcacagt ccctactcga cgtcgcctgt gggaccgggg cgcacctaga gcgattccgt 17040 
gccgaatacg . cgaaggtcgc ggggcttgaa ctgtccgatg cgatgcggga gatcgcgatc 17100 
agacgagtcc ctgaggtacc gattcacatc ggtgacatcc gcgatttcga cctcggcgag 17160 
ccattcgacg tcatcacctg cctgtgcttt accgcggctt acatgcggac cgttgacgac 17220 
ctgcgacgcg tgacgcggaa catggcccgg cacctggccc ctggtggagt cgcggtcatc 17280 
gaaccctggt ggtttcccga caagttcatc gacgggttcg tcaccggagc cgtcgcgcac 17340 
cacggcgagc gggtgatcag ccggctatcg cactcggtcc tggagggccg tacgagccgg 174 00 
atgaccgtcc gctacacagt cgccgaaccc accgggatcc gggacttcac agagttcgaa 17460 
atcctctcgc tgttcactga ggacgagtac accgccgcgc tcgaagacgc agggatccgc 17520 
gcggaatacc ttcctggagc accgaacggc cgaggcctgt tcgtcggaat ccgcaactga 17580 
gcccggaaca acgacgcaag acctggctgg gcaggccccg ggactcactc gaccgataag. 17640 
ccgacgtgac ccgggatcac ttcgatcgtc tctgatgccg gcaagttttg accgtcgtga 17700 
tctaggtgcc aaccaggtcg acgaagccct gggccctagc cgcgatcgtc ggcaaggtcg 17760 
ccgaggacga tcccgcctcg aatctgcgac cacctcgccg cagctgttcc ggcagacgtt 17820 
gtgatccgag ccgacgagaa ccactcggtt cagcccggca cctgcggctg ggaggtgttc 17880 
ggctcctgag cagccccaaa gcaccggcca tcggcgaaga actcgaccgg ctcccgcgcc 17 94 0 
ggatgcgcct acccgaccta cgcaaagccg cacccgccat gctggcaacc gcccacgcct 18000 
gacctaagcc gccggtgagc cacggctggc ccagtgtcgc caggcaaccg cggcgcaccg 18060 
gcggtaacgg cgcccgaagt acaggttgtc aatgaccgag acggcgaagg ccgacatccg 18120 
tacccacagg aacaggtgtt gttcggcgta cccggtcgca gcgagcctct ccgacttcga 18180 
ttgcaccatc gccgactgag cgcctcacca gggcacattc ccggcgtcgt cgaacatgcc 1824 0 
tccggtcggg ccgtcgtctg gcagcgtggc gagccgaatg gcgatcctgg caccgtctgc 18300 
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cggcgtgctg 



gttccgtgga 



agccgttaag gtcggtcgcg 



acgtagccgg 



ggcaggcgtt 



18360 



gttgattttg atgttggtat cgctgagttc. cttggcgtac tggatggtga tcgcgttgag 16420 
gaacgtcttc gacggtgagt aggctccgct gatcccgccg aggtcgacgc ccggcgtggt 184 80 
ttgcaaggtc agggaagcga cgtggctgga ctggttgacg atccgcgggc gctcggagcg 18540 
gcgtagcaac ggcagcatag cgttggtaac ccgaacgacg ccgatcacgt tggtctccac 18600 
caccgcccgg aggctcgccg gtgtgacggt cgagggctcc tccggccatg ccccggcgat 18660 
gccggcgtta ttaaccagca catcgagccg gccggcgcgc tcctcgagca gagccgcagc 18720 
agccgcgacg ctcgcgtcgt ctgtcacgtc cagggatacc gcgaacgcat cgacgccgtc 18780 
cgcacgcaat ttcgccacgg catcctcccc gcgctggtgg tcccgtgccc cgattccgac 18840 
gctccacccc agcgcgccga gcccggccgc gatctcgtat ccgattccct tgtttgcgcc 18900 
ggtgaccagt gcaatcgtct gttcgctcat gctttctagc gtggcccgaa accggcgggg 18960 
aaccaacacc atctaggtct cacaccgata cccagctgat atgaatcagc ggtaggctcg 19020 
acggatggag acgcgggagt tgcggtactt cgttgcagtc gccgaggagt tgcacttcgg 19080 
ccgggccgcc cagcgcctgg gcatcgccca gccgccgctg tcgcggacga tcgcccagct 19140 
cgagcaacga ctcggagtcg tgttgctgca acgcaccagc cgcaaagtct cgctcaccga 19200 
agccggggca atgctgctga ccgaaggccg ggcgatcctc ggcgcgctgg cagcagccga 19260 
gcgacgcacc cagcgtgccg cgacgagcca gccctcgcta gtcctggctg ccaaggccgg 19320 
cgcctccggt gagctgctgg cgaagttgct cgacgcgtac gccgccgagc cgggagccgt 19380 
ggccgtcgac ctgctgctct gcgaatccca gccccagaaa acgctgcatg acggccgggc 19440 
cgacgtggcg ctgttgcatc aacccttcga cccgacggcc gaactcgaca tcgaaattct 19500 
gaacaccgag caacaagtcg ccattcttcc gacctcgcat ccgcttgcca gcgagcccca 19560 
tgtacggatg gcggatgtca gctcactgcc ggatctcccg cttgcgcgct ggcccggccc 19620 
cgacggcgtc tatccagatg gccccggcgt ggaagtacgc aaccagacgc aactgttcca 19680 
aatgatcgca ctcggccgca ctaccgtggt catgcccgaa tccagtcgcg tcaacctgct 1974 0 
cgaaggcctc gccgccgtac cggttctaga cgcgccggac gtgacgacag tcatcgcctg 19800 
gccgccccac agccgctccc gagcactcgc cggcttggtc cgcgtggcca cactcctcta 19860 
aatcttcgct ccctcaattg cccgcagcta ggacgaatgc cgtgcgactg ccggaaacga 19920 
ccgcgcgcaa cacctcatgt gcctcagcgg gagaagacgc tatgcggatg ccgcagagac 19980 
ctcacggccg tcccgtcgcc aggagccagc ggtcccgttc gagctggtag gtttcggcga 20040 
tctgctccgt gacgcggtcg atgtagccga agctggttgc gagcattcgt tgcgtggtcg 20100 
ctgcggatac ggcctcgtcg gcgcattggc ggcggatctc gtcgaggcat cactacagga 20160 
agcggaagtg cccgatgcgg ttggccctga tcagcgcgtt gatggaaatg tttcgccagt 20220 
cgtcgcggat actccaacac cgcagcagga cccggcgaat atcccaacgg aatctcgaat 20280 
tcgagaacgt gcggcaatgt ggcgatattt tcctcaacac tcgtccgtag aatgctgagg 20340 
acgatctcat cggctcgcaa ctgcggagcc tcggtaatca ggtgcgtgta gacgtctcgg 20400 
ctgacttccg agacgcacga cgcgacgctc gatgccacgg ccgcctgtag cttcgaggaa 2 04 60 
tcgaagctgt ccgtggccat gacacgacgg tactgcggcg ggagcagaag cagcgcttcc 20520 
tgcgccgatg gcgcgcctgg agaagatcag caccttcggc ctgaccgagc cgggcgcacg 20580 
gcaccgacgt ggtccagctg gagctcggag acacagcgtg gcggtgttca gcgccgccag 20640 
gtcccgcagc cagcgctttg gcgtgcggcg cttcgggaga gctgggccat tacaggttgt 20700 
gcgagagcag ctggctcagc acctcgtcga cgttcatcca ccggtgttca tcaccaggtt 20760 
cgaccacgtc gtaggtgtcg ttcaagaagt ccgcaagctg gtcagcattc acctcgaagg 20820 
aggcctgccc cgacggcgag ctcatctcga tcacgaccaa ccccggaaaa ccccgtcgag 20880 
ggccgatccg cacatcgcct tcgcctgcct cggcgatcag cccgtcggcc agcaggtcgc 20940 
gggcgatcac ccagtccacc tgaccgtccg tgcctacgtt gagcttcatg gagatctcgt 21000 
acggattgcg gctgtcgtac cgcagctcca cgccgactgg cgcgatcaca cccgccgggg 21060 
ctaccaggtt gaagatcatg gtggcgagca ccgtgccgtg atcgtttgcc atgatgtcca 21120 
cgctcccaag ccttggcttc ccataccctc ataacgaagc atgcccaacc cgggttcata 21180 
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ccttcttcac gaagtcttca gaccctggtg gcagtccccc tcgggctccc agcgaggaac 21240 
cttcactcac actgaaccca cgggactctg ggcccggaac tcccgaagct gacccaaggt 21300 
gcaaagcaac cggcgagtag cttttcgctg cgccagtcaa tcaatgacgg ctccgggaat 21360 
gtctcgtgac catgctaggg cgtactcgcc gaaaaatggg atggatacgg gctgaccagc 21420 
aggaacgcgg tcagacggtc tggtcctcgg gaaagaatcg ggcgggtttg tctggcgcgg 21480 
gcgtcaggac tggtccaagg cccgttcgat cgcgcggcgg gcccggccgg cggccgcccg 21540 
gtcgtgctgg agctgcatgg ccgcgt-tcag tgccagcccg gtggcgacga tctcgaacag 21600 
cgcctggtcg atgtcgaacc cggcgggcag ctcgccgttc tccaccgccg cggtcaggtc 21660 
cgcccgcagc tgtccccgcc agcgcgacca cacctcggcc accgcgtcgc ggacccggcc 21720 
ggsgcggccg tcgtactcgg tgagcgccgc ggtcatcagg cagccgccgg gcagcagtgg 21780 
cgcttctagg tagcccacgg cgttggcgca caccgcgcgc agccgccgca gacccggcgg 21840 
ctcggccagt gcgggctcga ccacccggtg ccagaagtcc acgaatgcct tgtccagcgt 21900 
ggagatctgc agtgtctcct tggtgccgaa gtgcttgtgc accccggact tgctcatctc 21960 
cagttcctcg gcgagccggc cgatggtgat gccgtccagc ccttcctcgg atgcgatctc 22020 
ggcggcacgg tcgaggatcc ggctcctggt ggcctgtgct tcggccgctg agcgtcgcgg 22080 
tgacatgtga cgaggatagc gtacgcgcgt tcgctattga tttagtgaac gtgcgtacgc 22140 
taaattccgc ttcacccacg gaccgaagga gtgcgagatg cgggtgcgac gactgggctg 22200 
ggccggactg gaaatcgagg cgggcggcaa gcgactggtg atcgactacg tacgggacct 22260 
ctcaccgctg ttcacggcgt ggaaacccgg cgagaggctg gcggtgccga gcgggacggt 22320 
caccgccgca ctggtcaccc acctgcaccg ggaccacacc gacgcggccg cgctcgcggc 22380 
cgcgctgaca ccgggggcac cggtgctccg acccgcgccc ggccacggcg acgacgtgga 22440 
caacgtgacg acactgccgg ccgagcgcga gctgacactg caccgactgg ctgccgaggt 22500 
cgtggatgcc tggtccaccc gcgacctcgg gccgttccgc gtcaccgcgg tccccgccgt 22560 
cgacgggctg ggcgacccgc agctgaactg ggtggtgcag gccgacgggc agcgggtctt 22620 
ccacggcggc gacacgatgt tccacggcta ctggtggctc atcgcgcgcc ggttcagccc 22680 
gttcgacgcg gtgttcctgc ccgccaacgg cgcggtggtc gacgcgccac acctccagcc 22740 
gccgagcccg ctgcccgccg cgatggaccc gaggcaggcc gccgcagccg cagagatcct 22 800' 
cgacgcccgg tacgcagtgc cgatgcacta cgaggcggaa cagccggaca agatcgcggg 22860 
ctacgtcgag gtctccgacc cggagaatga gttccgcacg cacgccggac accgcgcaca 22920 
cgtgctgccc atcgaggaat ggctggacct ggccacttga ggcggaaact ccgccgcgcc 22 980 
ggcgggctca ggtcgcaaca gaacctaggt ccgatggatg gtccggaatc gcgcgggaat 23040 
cggttgctcc aagacactct ccggtcgcgg cggcgtcaat cacaagattc agcatccatt 23100 
actgctggaa accacgatca aatgcgacac tccgggttga gccgaaccca cgggtccagt 23160 
cctggtcctg ggcgggacgg acccttgggt tcccgccgag gaaccctcac ttgcgctgaa 23220 
cccacggcgg ccgaagcgtc ccggtagcag acgccggaaa ccagcgctta gctctggaac 23280 
gctggctgcg gaccgggccc agaggcgagc gctcggtggg tgctgagttc ggggacgtac 23340 
ttgtagatcg tcgagcggaa gacgcaccgc acgcttgagc ttgaacggcg cagcacgcga 234 00 
cccgcccgag ataccggtta gcgtggtcag atcaacactg cccgttgtgc accattagcc 23460 
gcatatgtcc ggttaccatc gattcgatgc ggatccggta cgtgtcaaag ctctctacgt 23520 
cgacgggtac ctgaatgccg cgctcaacga cgcgtcatcc gggtcagccc ttctcgagca 23580 
ggcgagcgcg ctggcggcgg agttgggcga ccaatcaggg gcggcctacg tcgcgcagat 23640 
ctcggggctc accgccctgt tcaacgaaga tccggcgcgg gcgagcgcgc tcttcgaaga 23700 
cgctctcgcc aagcatcggc tgattggcga tcacgctgcc gccgcctacg atcagatcga 23760 
gctcgcgttg gcagtcgcgt tccttggtga tcatgagcgc gccgctgccc tgttccgcgg 23 820 
gtgccctgcc gtcaaggaga cctacagcga acagtggatg agatcgcttg ccctgtgggc 23 880 
gaagggaata gcagacctac tagcgggcaa ttaccggcaa- gccaatgctg cggaactgga 23940 
gagtcttcgc ctgcggctcg gcttccatga gcagtttcag atcgcgttgt gcgttgagat 24 000 
tctggcctgc atagccagcg cagacggtga cccggcacgg gctgcgacct tgttcggcgt 24 060 
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gtcccaaacg atcaaaaaga acgtcgatgc 
cctacacgac cactacgagg cagtagcacg 
aggcgttcac cacgccacac agctcgactt 
gggcaaagcc ggggaaccag eaagctccag 
tccgcgcgag cgagaggtag cagcgctggt 
cgccatcatg gtgatctccc gccgcaccgc 
actcggcttc accactcgga cgcaaatcgc 
acgatgcgac tgacaaccgc cggttttcgg 
ctgcgagacg gcccctctcc acagcctctt 
cgcttgagga cggcatccga tgcccaccgc 
tgatcgagaa cgctgccgtg gtgacgatgg 
acatcctcgt cgacaacggt gtcatcgtcg 
ccgagcgaat cgacgcctcg tcgatgatcg 
acacctggca aaccgggctc cggggcattc 
ggggattccg gctgcagatg gcgacgaagt 
acctcggtgg cctcgattgc cttaactcgg 
acatcgtcac cggcgagcac gcgcacgccg 
gcgcgctgta cggccacggc ctcctgccca 
gcgggctgga cgagtcggtg gaagaaggcg 
aaatccgcgc ccagtacttc ccgagcgagc 
aggaactagc catcgctccc ttcggggacg 
tcggcgcacg catcacgttc cactgcaacc 
tcgaggcgct gcactcccac aacatgctcg 
tcagcactga gaacgaatgg cacctgctgc 
ccgagaccga gatgcagatg ggcatgggat 
cgccggggcc aagcctcggc atcgactgca 
acgcacggct cgtcctacag gtgacccgat 
cgacactgcc cacggtcatg cggtggaaaa 
acggtgcccg cgccgcgggc gtcgacgacg 
ccgacatcgt attgctggac atgtcaggca 
cttgtggcgc gatcattgcg cagaccaact 
ggcgggtggt caagcggaac ggccggctgg 
cgctcgcgga gtcgcacggt catctctacg 
cgcaacctcc agccgagcta ccggtgttca 
cgcgccattc gctgcgtgag gtgacccggc 
gctgctgtgt gacctgcacg gacgaggtta 
cggtcgctca tcatgccgcg cccaccgtca 
cggccgggcc cctggaatag ccgcttccac 
cgatgatcgt ccccgtgaac gcgtggggct 
gaaggcgggc tgcgggaacc ggcgttgact 
ccacatcaat ggggagggcc cggtgacgac 
gtccgaacgc accgtgaccg ttgaacggtc 
agcaactcga cgaacgcgcc ggcgatcttg 
gacccggctc aacacatcaa cccagccccg 
cgtggcaatc gccctaaagg agggccagct 
tgtgaagcgg gcgtgagtga gttcatagag 
acatggagaa cggcggaccc agccggacag 
accagatcgc tgaccgacca cagatcttca 



gtcgctcgcc ggccacaaac atgttgcgcg 24120 
tagctcactc ggcgacgaga cgtttcagaa 24180 
cgacgaggcc atcgcatgga tacggaatcc 24240 
gacggaagaa acgcacgcgt cggtgcttac 243 00 
cgcacagggc accaccaaca aagaaatcgc 24360 
tgaggcccac gtcgagcaca tcctgaccaa 24420 
gacctgggtc attcaacaga agaccgcgcg 244 80 
attggcgtgg accagtagaa gctgccccgt 2454 0 
ctccacagtc gatcaacgac cgaacgggac 24600 
gcggagagga gcacttggcc atgcgcacac 24660 
acgacgttct cggcgacttc gaacgagccg 24720 
agatcgggcc gaacctcaac gttggcgaca 24780 
ccatgccggg catggtcgac acccatcgac 24 840 
tggcggacgg gaacatcctc gactacctga 24900 
accgcccgca ggacatgtat gcgggcaact 24 960 
gggtgaccac gctggtcgat tactgccaca 25020 
cggtcgctgg actccgcgac gcgggcgtgc 25080 
tcacgtcgaa cacgtggtcc gagaccaaag 25140 
acttcgcccg gcgggcgcgg ctggcccggg 2 5200 
aacaattgct gcgcttcggc atcgccccgc 25260 
tcaagcagga attcgagctg gcccgcgagc 25320 
aagtcatcgc gcgcaacctg ttcaaggaca 253 80 
gcagcgacct tctcttggtg cacgggacat 25440 
ggggtaccgg caccatgatc tcggtttgcg 25500 
tcccggtcat tcgtgaggcc accgagaaca 25560 
cgagcagtac ctgcggcgac atgatttcgc 25620 
ggagggacga tcaagaagac tacgcgcggt 25680 
cgcgcgacgc cttacggtgg ctcacggtca 25740 
tgacaggaac cctggcaccg ggcaagcggg 25800 
tcagccaggc cggttggaac cgccacgacc 25860 
cgggcaacgt gcacacggtg ctggtcgatg 25920 
tgcacgtcga cgtcaacgga gcgcttgcaa 25980 
accaaatggc ccagcacggc ggattcatcc 26040 
accgctgaga accaagccag ctgtggctgg 26100 
tggccgggct cgaccggaaa accgtcagac 26160 
tcggcatagg caaggccagt cgcagcagca 26220 
gccgcgtgcg agcgccgatt cgagcaacca 2 62 80 
ctgcacggag caggtattcg cctcgtggac 26340 
gccgtgcggg aacgcgccgt cttccggggg 264 00 
gaccactcgt gcgcattcga gcatcagcgg 26460 
aaagaccgtc aggtgtggtg gcaggacgcg 26520 
ggccgagctg tcgtttcttg ccgcagcaac 26580 
ccgcgccggg gccaccaacc gcgaactcat 26640 
ataacagggt tcgagccctg cctgtcgact 2 6700 
gtggttacgg atcgtgttcg cgtgctcctt 26760 
cccgcgccgc gttcgtgagt aagttcttga 2682 0 
cgctcgctac cgcgtacgcc cgcgcctatc 26880 
ccgaccccct ggcggtacgt ctgctcggcg 2 694 0 
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tcaccgcaga ggaactgacc aaattggcca cgcccacaac cgatcacctc ggcgccgggg 27000 
caatcgatca gccccgccgc ctgttcttcg ccgcccgtgc ccgcttcgcc gaggacgccg 27060 
tggccgcggc catcgccgat ggtgtacggc aggtcgtgat cctcggcgcg ggcctggaca 27120 
ccttcgccta tcgcaatccg cgcccggact tgcgcgtctt cgaggtcgac caccccgcca 27180 
cccaagcatg gaaacgcgaa cgccttgcta gcgcagggat cgaccgccct gagacgttga 27240 
ccttcgtacc ggtcgatttc gaaacccagc cgctggcaac tggattggaa tccgccggat 27300 
tcaaacggac ggacccggca gtattcgtgt ggctcggcgt tgtcttctat ctgaccgcga 27360 
acgccgccca .cgccaccctc gaatacatcg ctggtcaagc ccagccggtc gaggtggtct 27420 
tcgactacct gcagcctgcg gccaccgacg aggaccgtgc gcaccagcgg gcacgtgccg 27480 
atcggctggc cggcgccggc gaacccttgt tcagctactt cacaaccgac gacatcgccg 27540 
cacagcttca cgccctcggg ttcaccggca tcgaagacca ctccgccccg gacctcatcg 2 7600 
ccggctacct ccacggatcc gcaggtttcg agggcgaacc accccgcgca ctgcgcgcga 27660 
.gccgcatact gcacgcgagc cgctgaggtt tgcctgccgc gcctacttgg accacatctg 27720 
tcgcgacggc tagcatctcc cacatcgtcc tctgaactgc acaaactcaa aaaatcgcgg 27780 
gagcgcacgc gggtagggccr ccggcgaaca agccctagcg atacatcact ttgaacaagg 27840 
tcgcgtctcg ataggctgcc aatgcgccgt tgcgtttcgc gcgaaccgcg cggtcgactt 27900 
ggtcgtggcg tagtcaagga accgctgcag ctcatcgcgg gtgaacgggc ggtcgtccgg 27960 
atcgccctca taggactgga gatgggcgag tgagttccac tcgtgcacat tcgccaccgg 28020 
gtactcgtcc ggcccgaacg acgcctcgca cgccagcgcc cagccatacc ggctgtcaca 28080 
gaggaattcg ttgaacagcc tgagactgcc ttggtaggcg cggattgtgg atggcgcgag 28140 
atgccgttct ccggtcggtg attgcgacca ttcgtccaca tgcgacggtg tccaccgcca 28200 
cggatactcg ttggtgaact ccgcgaaccg gcgcaccagc cgctcccgca ccccggccag 28260 
gtgcgggaac attgccttga tcaaaccgca gtccagacga tcatgatcaa cgagtgtgat 28320 
caccagcagt cactggagcc ggcgttgcgt ccacacggtt ggccaacccg agcaggtggg 28380 
tgcacagcaa cgttgttgaa gcacaacgcc tgactggttc aggtccggcg gcagcccgta 28440. 
ttccagccgt ctcggcgcgg ggacgttcgt gttcagagat aaaatcgtgt tgcccggttg 28500 
gtcggtttgg gcggttttga gtccgtgcga ggccgattat gcggcatggg gc.ctgtgttg 28560 
aggaggtgcc gtggttacta cgcagcggaa accggcgggt gctgggggaa cgacggtgag 28620 
tctgccgggc gtaacggccg agttccaccg tcccgatcac tacattccca ctcgtgatga 28680 
tcttgctagc gcggctgagg ccgttcgggg cgttctcccg gaccatcaga cgatggcctt 28740 
ctatcgtggt cttggagcgc tgaccctgtt ggggttgatg gactggccgg tggcggcggc 28800 
gattggcgtg ggtgtcgcgg tcgcgcagcg tgccggcgga ccggcgtcga agtgatccgg 28860 
tttgtagcgg gttccgggct aggcgtcgct gagcgcgccg cggcggcgct caggccaggt 28920 
gaacctgggt aggtgttgcg ttgcctgcgt ccggcgtagc tgcatgctcg gagccgtttc 28980 
gaagaacccg agcacacagg cggccttgaa ttcaccaccg ctgggactac tggtgtgtcg 29040 
gcgggtcgaa cagcgtggcg aagccgtcgg tgttgctgat cagctcgcag gccaccgaga 29100 
ggacggatgt gcggctgttt cgaggttcgc tttccgctcg tcggtcctac gctcgtgact 29160 
cacggtatct gtgcgcgcgg gaggtagtgg tggctaggac ggcgcggaac acaacgcctg 29220 
gcgcaggcgc gcggcaacaa ggggcgcggt- tgcgagccgc ggcggtcagc ctg-cccgtcg 29280 
tggcacagtt tcgtcgggtc gatgttcgca tgccgggtcg ttgggaggac gcggttcgcg 29340 
cggtacgggt cggttggggg ctcgaatggc cggttgcggg ggcgaccggc gcgggggtgt 29400 
cacgggcgtc ggtcggcttt gcgtcggaca gctgagtgtg ttcgatgctg cgtaccttgg 29460 
gcgcgttggc ctcgcagatc gtggcgccgg tggtggctgt cgcgcagagc atcgcgggct 29520 
tggggcgcgg gcggcgggcg tggtcgtcgg aggggcgtgc gtacgtcgag gtgcgcggtg 29580 
ttgaccagcc gggggctgag gaggctggcg agcagatcgc ccggcggttg gagaaggtgc 2964 0 
cgggtgttgc gtgggtcgag atcaacgctg tcgtggggcg ggtggtcatc gggcacgatc 2 9700 
cgggcctggt tggggtcgct gagctggtgg gtgtggtgga ggacgcggag gcggacaccg 29760 
gtttggacga ggaaccgttc gctgcggtcg gtgtgagcca cccggacaat ccggtcaagg 29820 
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tgatcgctga ggcgacggtg ctgggtgcga 
cccggatgct gccgctgccc gtgctgccgc 
agtcggcctc ctgggtccgg gggccgctgg 
tgttgttcgg gtttagcagt gcgttgggga 
tggcggaatc ggggtaccgg ttctgcctgt 
gggcgcggtg ggaagccgga acgggggcgc 
gggttcggcc gcgtcctgtg ccgttgcccg 
cggctctggg tgggccggcg gcgttcgcgg 
gtgcgcaggc gatggtggtc gcgggcgcgc 
tcgcggcgca gctcgggtac gggctgtcgg 
tgttgcggcg gttggaccgg gtcgacacgg 
gtgctcggtt ggtgcatgac gtgattcccg 
gggaacgtgc gcatgaactc gtgggtgtga 
ggtcgctgta ctcggtcgct cggtcgcggt 
ggggtatcga ggtgttcgcg ctgcgccacg 
ccgaggagct ggatccgctc gctgaggagt 
tggttctcgc gggtgatggg ggccggttgg 
cgggtggggt gcggctggct tcggtggtgc 
gtgtggtgtc cgcgcgggcg cgggcggcac 
cgcgtggtgg tgaggttccg tggggtgcgc 
cgtgtgcgtt gttgagtacg gtcccgaccg 
tgtcgttgat cggttcggtc agcggggcgg 
gccaggtccg tgcgggcgtt ccggtggccg 
cgtggtgggg gatgcaggcc gcgtggcgtc 
ggcatgcgat gggccccggc atggtgctgg 
ccgatgatgc ggcgcacgag cgtttccaag 
tgggtgtgtt gcgggcgtcg ctggaggagc 
gtggggccgc gatctcggcg agtatcggtg 
tgctggggat gaacgcgttg atcggcggcg 
acaggcttct ggaggccagc gttgttcggg 
agttgccggc tgacgagctc gtgctcggcg 
ttccggccga ctgccggctt ctggaggccg 
ccggggagtc gatgttggtc accaagacgg 
accggacgtg catgatctac cgggggaccg 
tggtcgcgac cggcgcgaac accgaggcag 
gtgccgatgg tgtcgcggct cgcctgcaga 
tgggagcggg tgggattctg ctcgtcacgg 
cgctggggcg ggcggtgagc ctggcggtgg 
ccaccgtcgc cgaactcgcg gccgcccggc 
actccgcgac gatcgaggct ttggggcggg 
ccctcaccga agggcgcatc gcgctgcggc 
tcgaggaaat ctccacagtg gaacggcgag 
tggtggagaa cggagaactg ctcccgcatc 
gtggcgcggg agtgcacccg gccgaaggca 
ccttcgagcc ttcccgtggc taccacgcgg 
tgagtgtgaa aggtgccccg gagatcgtgc 
atggcgacgt gccgttggag gagaccgctc 
tcgcccggca gggctaccgg gtgctggctg 



acttgaccgg cgctgtgatc gcggtggggg 29880 

cgtctgttcc ggcgctggtg tcgaccggtg 29940 

aaacgcggat cgggaagacc gccactgacc 30000 

acacgcttgc gcggcgtccg gcgagtttgc 30060 

tgcgggagac gcaggcggcg cggcggagct 30120 

atcctgccag tcaccgtagt gcgccggtgg 30180 

ccggcccggt cgagcgcgcc gcgaacggtt 30240 

gcatgctcgg ggtgacgcgg agttttcagc 30300 

cgcgcgcggc gcaggtgggg cggggcgcgt 30360 

agcggacgac tctggtgttc gatgcccggg 30420 

tggtgatcga cgcggacgtt ctgcgcaccg 3.0480 

cggacggggg gcgcgcggtc gtcgagttgt 30540 

ccaagggggt cggcgcgcag cgtgatggtt 3 0600 

tgcccccaga cgtgcgcgag gtggccggtc 30660 

attcgcgcac ggccgcgtgg gttctggctg 3072 0 

tggtcgcggc agctcgcgag gtcgggtcgg 30780 

cggggcggat cggtgttgat cgcgtggtcc 30840 

gggagatgca gcgggagggc agcgtggtcg 30900 

tggcggctgc ggatgtcggg cttggtgtga 30960 

acctgatctc gggcgcgagc ctgggggagt 31020 

cgcgttatgt cagcgtgcgt agcgcgcagc 31080 

tgttcggcgc gttcgggcct gccatcggga 3114 0 

tgtgtgcggc aatcgcgctc ggcgctggga 31200 

cggttccgcg cgcggtgccg cgcactgcct 31260 

agctgctcgg cagttcatcg cagggtctga 31320 

gtcaggacga ggggacgggc cttcgccgga 31380 

tggcgagccc gttgacgccg gcgttggcgg 3144 0 

ccgttgcgga cgcgacgatc atcatgggcg 31500 

tgcagcgcat caccgcggac cgcgctttga 31560 

ttgtggtgcg ccgtaaggaa aatgtgacgc 31620 

atgtggtgga gttggtggcc ggggatgctg 31680 

acgggctgga ggtcgatgag tcgagcctga 3174 0 

tgacggcgac tccggcggtg ccggtcgcgg 31800 

cggtcgctgc cgggcgggcg gtgggtgtcg 31860 

gcagcgcgat cgggcccgat gcccggcgag 31920 

cgctggccaa ggtgacggtg ccgatctgcg 31980 

acatgctgcg taggcgtcca ttggggacat 32040 

cggccgttcc cgagggattg ccgttcgttg 32100 

ggctgtcgaa acgcggggcg ctggtgcgta 32160 

cgcaggtgtt gtgtttcgac aagaccggca 32220 

aggtatccga tggcgcgtcg agccggtggc 322 80 

tgctcgcggg tgccttgcgg gcaagtccgt 32340 

ccaccgatcg cgcggtcgtc gatggcgcgc 32400 

agcagcgctg ggtaccgatc gacgagatgc 32460 

tgctcggcac ctggcgcgac ggcaatctgc 32520 

tggcccagtg 1 cacccggtgg cgcagatcag 32580 

gtcgaggcgt cgagcgagtt gtcgaaaacc 32640 

tcgcggagcg ggccgcctcg gaccgcagtg 32700 
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atctcgacga gtcccggatc cgggatctgc ggctgatcgg tttcgtcgcg ctggccgacc 32760 
cggtgcggcc cactgcggcc gccgccgtcg cgcagctcca gcaggcaggt gtcgaggtcg 32 820 
tgatggtcac cggcgaccac cccagcaccg ccgaggcgat cgccgccgaa ctcgacgcgc 32880 
tcaacgafcjcg tcgcgtcatg accggcaccg aactggacac cctcgacggc gaccaactgg 32 940 
cggcggaagt gaccaccgtc gcggtgttcg cgagggtcag cccggcgcag aaggcgcgca 33 000 
tcgtcgacgc cctgcaacgc aacgaccgtg tggtcgcggt gaccggggac ggcgccaacg 33 060 
acgccccggc gatccggttg gccgacatcg gcatcgcgct gggcgagcgc gcaacccccg 33120 
ccgcccgcga ggccgccgac ctcgtggtca ccgatgaccg catcgagacc atcgtcgacg 33180 
ccattgtgga aggccgcgcc atgtgggcat cggtgcgtga ctcgctggcc atcctgctcg 33240 
gcggcaacct cggcgagatc ggcttcaccg tcgtcaccgg tgtgatcaac ggcgggggca 33300 
gcctcaacgc ccgccagttg ctgctggtca acctgctcac cgacatcctc ccggcgatgg 33360 
cgatcgcggt gcgaccaccc ccgacgatca. ctgcggagat gctgctggca gaaggccccg 33420 
aagcctcgct gggcgctgcg ctgacccgcg acatctaccg ccgcgccgcg gtcaccgccg 33480 
gggccgccat ttgtgcgtgg atcctgggca gggcgaccgg aaccctgcgc caggccaaca 33540 
cggtcgcgct cgtggcgctg gtcgccgccc agctcggaca gaccctggtc gtgcgcggac 33600 
ggacaccact ggtcatggcg tgtgcgcttg tctcgttcgt ggcactggcc gctgtcgtcc 33660 
aaactcccgg cctgagccaa ttcttcggag ccagcccgct actcccacac ggctggatga 33720 
tcgccgctgg cgcaacgacc gccgccaccg tcatcagcct cttgctccaa caatggtgaa 33780 
cctccatgca tgagcgtcaa gatcccttca ccacagccat cgtcgggccc gactcgggac 33840 
aacttcgagc aggcaagacg ttggagcgcc gagagcggtg aacgactcgc cagcggcgaa 33 900 
agcgcctgcc cccagccgac gggcggcgga cccacaatgg gagctgcggc gacgtgcatg 33960 
cggtcggtca tcaggatctc tgtcgtgagc ccgtccggat agatgaccgg ccactgcggg 34 020 
taggactcgg cgatcactag gtcgtagtcg tcgacctgga gaccctgaag agccgcatcc 34080 
ggctcggctt gaacaatcac cacatccagt cctggacgcc gccggcgtag acgagtcagc 34140 
acctcgggga acaggatgaa ggtgacggtc tgcatggagg ccacacgcag acgttgcgag 34200 
atctcgccac gtgagcgatt gatctgagtg cgcagcgtct ccagctcagc gatcacccgg 34260 
tccgctgcct ccacgacgag ctcacctttc gccgtcaggc ggacgcgacg gccttcgcgc 34320 
accagtagcg gagcggatgc cgcccaccgt gccgccgcag ctggcccgcc gcataaccgc 343 80 
cagcagtggc ctgaccagtg ccaacatcgc ccacacgggc ccggtcggcg accttgatca 34440 
gtggtacagg ctgggcaggc cagcgggcaa cgcacgccct ccccgcaccc accacgatca 34500 
acatcggccg agatcgtaac gaaccccgtc ggtgggttcg aagtggttcg aacagttgtt 34560 
caagctgtaa gtgcgctgcc agccatgacc gggctacaat aaaacgctag ttctaccgag 34620 
gtgcggtgag tcgtgtctgg ggttgcggtt cgggaggctg gtcgcctccc atatgaggtg 34680 
acgagctttg tggggcgccg tcaggcggcc acaagagtta agcgcgcgct gtccagttca 34 74 0 
cggttagtga ccctgactgg tttcggcggg atcgggaagt cccggctggc gctgcacgtc 34800 
gcacacgacg tgcggcgggc cttctccgcc gggacatttc tggtggagct ggctaacgtt 34860 
cgcgatccca tgctggtgcc gcaggcggtc gcggctgctc ttgggatcca tgaccgttcg 34 920 
acacgtgccc cggaggccgt gctcgtcgat tacttggcgg acaagcaagt cttgctggtg 34980 
ttggacaact gcgagcacct gctgggggcc tgcagccgcc tagtcgctgt gctgctggcc 35040 
gcggccccgc ggttgcggat acttgcaacc agccgagaac atttagggat agtggcggaa 35100 
cagacctggc cggttccgcc actgtccgtg cctgctggca gcgattcatc aacgtcacaa 35160 
caggaaggtc gtccctacca gcgtgaagcg ttgaccttgt tcgagcagcg tgccgcggcg 35220 
gcgctaccag ggttcaccat caacgaagaa aacgaacaga tcgtggccag gttatgccgg 35280 
aggctggacg gcattcccct agcgatcgaa ctcgccgccg tgcgcgttcg cgtgctctca 35340 
gtcgggcaaa ttctaacgcg aatggagaac cgattccggc tgctcaccac cggcgaccgc 354 00 
gccgacgatc ctcggcatca gaccctgcgc gccgccgtcg actggagcta caacctgtgc 354 60 
aatgagcatg agaagcagtt gtgggcgcgc tgttcggttt tcaccgacga attcgatctc 35520 
gacgcagccg aaaacgtctg caccggcgac gggctgaccg atgatgacgt cttcgccgct 35580 
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ctcgcgggac tgatcgacaa gtccgtgctc 
taccggatgc tggagactat ccgtcaatac 
gaaaacgctc tgcgctgccg gcaccgcgac 
gccgagtccg cgggcccgca tcaagccggt 
aacttctggg cagcgttgga ctactgcttc 
cggatggtga gcgcgctgtg gttctactgg 
atgtggctgg acagggcgtt ggccctggag 
ctgtggatca ccggatggat tgcccaccgg 
cttaacgaaa gccgcgagct ggcccggaaa 
actcagtttc ttggcgaccg ggagatgtgg 
ctcgacgagg cactggcccg ccaccgcgca 
attttctcaa tacgagcgca ggcagcgggc 
cttctgcacg aatgccgcaa gatctgtaca 
gcggagtgga acatggcgat cacttggtgg 
tccgcgtcca ggtccctccg caataagcgg 
tgcgttgaac tgctggcctg ggtcgccgcc 
cttttcgggg cgttggataa aatgtgggaa 
acgctactga cctggcgtga gcaggccaag 
gaatacgaaa ccgcccgagg ccaggggacg 
gctcttgggg agaagccacc gccggtcgaa 
ccggttctca caaaacgaga acgcgaggtc 
aaagagattg cggctgacct ggtcatttcc 
atcctttcca agctgggctt tacctcgcgc 
agataacgta gtcacctgct ctgttggccc 
gatccgttcg cggcccggca acaccgcccg 
acatccgctt cagctaatcg tccgcggccc 
tccatgatcg gaggctggcc attgccgatg 
tagttgcggc cgcccacata ggacccgttg 
ggccagcttc gtaggccttg gctgttgccg 
gtctcccagc accatcgagc caccgttcgt 
gcacacgtcg atagcgggct cttgagccaa 
gagtccgcga ccagcgagcg ccccggcgac 
cgccacgtct gccgatgccc ggttgtcctt 
agctttcgtc atcgagcctg tcccacagtc 
gccgcccgtg cgaaggccaa cgaccagacc 
tcgccacgca cagcctcgat gaaccgtcga 
cccagatcga tcttcctgga cccggccgga 
atcgactgga tacccgtgcc gatgagcgct 
ggggcatgtc cggtcacgaa gctttccacg 
gcgcttccgg agattgtctc atcgattccc 
gccacgaccg ttagcggatc ggcattgagt 
gtgaggtcga gtgcgactcc ggcgcccgct 
gcgggcggca gtcggatcgc gttccggacg 
gcggcgattg catcccggat caaacgatgt 
gcgaacacta cgcctgcctt ctccgcggca 
agggcaagcg gtttttcgca caggacgtgc 
tcgtgatgtg ccgagttgat actcgagatg 
gccacggtga gttcgtcatg gccccgggcg 



acccggaccg aggagcgcac gaagacgaga 3564 0 
gggcacgacc ggctcaccga ggcgggtgcc 35700 
tactacctgc atctcgccga gcagtccgat 35760 
tggctcgcgc gtctgcgcgc cgagcgcggc ,35820 
accacacctg gcgaggcgtc ggttgggttg.,35880 
gctgctggcg actatctgag ggagggccgc 35940 
accgaaccca ccagccagcg tgcccgcgca 36000 
caaggggacc gagatggggc gctggtcctg 36060 
ctgggcgacg aaacggaatt tacctacgcg 36120 
gacaacaacc tgactcgcgc cacgcaactg 36180 
actggccact ggaccggccc cgccttagcc 36240 
ctgcttggag accttgaccg ggcgatggtc 36300 
gaacttggcg agcgctggag gctatcgtgg 36360 
gccgcgggcg acccaaccaa ggcagccgcc 36420 
gaactcaacg acctactcgg gatcagcgac 36480 
gcagaacgca atcctgaacg cgcggcgatc 36540 
ctgattggca ccccgctgtt cggatccgag 36600 
gcacgcgccc aggaagcact cacagacacc 3 6660 
cggatgaacc gggagcagac tatcttctat 36720 
gcggccgccg cttcggctcc ggaaaccgaa 36780 
gctgccctgg tcaccgctgg caaaacaaac 3 6840 
cagcgcacag ccgaggccca cgtcgagaac 36900 
gcacaggtca tcagttggat gaccgggcag 36960 
atgcccttct cgggccagcc cggctcacat 37020 
tcgtggcaag gtctgcgata ccgaaagcgg 37080 
ccgcaacggc gcatgttgaa gtaggtatgg 37140 
gccatgaccg cggagttcgg gtgctgcatg 37200 
gcgagcttac gcggtataga ttgagccacg 37260 
ccgacgagta gtgagtagcc ggtgtgaacg 37320 
ccggcttgta gtgcccgtgc gcgaagcctt 37380 
aaactcgtcc cgttccaaca cgagcacatc 37440 
gatcagtcag cctccaccgg acccgacgat 37500 
gtgcatgatc tgttgtgcct aactgcgccc 37560 
tgccgatgcc cggtccgagc agcctccagc 37620 
ccgtcttggc cggaggccgg gggctcggcc 37680 
acacccacga cgtagaggtc ctcgcgggta 37740 
ccgacgtgtg tcagcgtgcc gaccggtgac 37800 
ccttcggttc cgaaaacctc caggcatgtg 37860 
aatgaagcat gcacaccgct gacgaactcc 37920 
accctcgtca tgtggccgga cgaggtatgg 37980 
acgaaccgga gacagtccgc gtcgtgcacc 3804 0 
gccgaatcgg tcgtccgcca tctccgcgcg 38100 
ttcactgcga ccggctcgcc gatgtcgccg 38160 
ggcacggagt tgcgcatgtg gtggtttgtc 38220 
tcgacaagtt cttgtgcctc ggcgacggtc 38280 
ttgccggctg cgatcgccgc aagtgcttga 38340 
tacacggcgt cgacgtcagg atcgctaagc 38400 
ccgaatggtg ccgctacatc ctgcgctcgg 38460 
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gccaggtcgc cgctgacgac gctgaccacc gcggcgtcag gctgggcgga gatcgctggg 38520 
cctacaaatg cgcgggcgat gtcgctggcg cccacgagcg accagcggaa gggacggttc 38580 
gtcatgattc gatgggttcc tttcgggaat accgcgcctg tgcgacattt ccgctggcgc 38640 
acagacgtga ggccagcgcg ggtggccgcg aggattacac gaccatgaaa cgggtcgccg 38700 
cttgactgtt agcaagatcg gaggtttcgc ataccccgcc tgatctcgct tgcccacgag 38760 
ctgtcctggt agtgccaggt cgtcaggatt ttgtcagcgc tccttcccca cccggctgca 38820 
cgtctcccga gccggccttc cggatcgaac agcccgacat cgccgtcggc tggctggatg 38880 
ccgtagcgtc ggcgtagctg tttccaggga gtacgccggt gtttgcgctc ctgccaccgg 38940 
acgacctctt tccagagata cgatcgtagg tagctgaagg tcgcgttcgg gcatccatac 3 9000 
ctgaagtacg ccttccaggc agcatccggt tccggatagc tttcaatgtg gggaaaagct 39060 
gggtcgccgg cgacccgaaa accgacggca ccgctcgcgc gcagcgtgca ccctggaggc 39120 
gaggtccgtg ctgctgatcg tgatcccgct ggcgagtggg tagccctgga tgaccgtaga 39180 
tcagcgcacg taggtgatga gtttgggctc gggatcgctg acgtcagtga ccccgaggag 39240 
cggacatggc ttcgggttgc gctgatctgg agacaggatg tttgtcgggg ttacacgttc 39300 
tggagcagtc cgccatcgac acggacaacg gatccggtga tgtagctggc acgctcgctg 39360 
gcgagaaatg tgaccgcgtc ggcgaattcc tcgcgccgcc cgtagcggcc gaccggaata 39420 
gagctctctg attccctcgc gatgttctcc gcagtgcggt tctcccgtgc ggcctttgcc 39480 
tggtccaact gggcgatgcg cgcagtggcg atgcgccccg gaacgacgat gttgctggtg 39540 
atgccgtcgc ttgccacctc accggacagt gtcttggacc agcccaccag actcgagcgg 39600 
agggcgttcg acaacccgag • gttggggatg ggcgcgatga cgccggacga agtcgacgtg 39660 
atgatccggc cccatccccg ctcggtcatg cccgggagca cgcggtctgt gatcgcgacg 39720 
acggagagga tcatcgcctc gaagtgccga cgccacaggg tggcgtcctg cccggacaca 39780 
cgggtggscg gcgggccgcc ggtgatgttg accaggatct ccaccgggcc gagctcgctc 39840 
tcgacggcgg tgacgttctc atcgatgacc gacaggttcg ccaagtccca gaccaggctc 39900 
atgttatcgc cggtcatgga cgcctcgacc tccgtgagcg cgtccttgtc gatgtcggcc 39960 
agcgccacac gcgcgccttc ccggcttagc gaggcggcaa tggccgaacc gagtccgccg 40020 
ccggctccga gtacgagggc gactcttcca ttgattgcga gatccatgac atgtccttgc 40080 
ggtttcgggt gacggcggcc cggaaagtcg tggcacgccg ttgatggagg gacattggtg 40140 
ggaggagtgg cagtggcccg gtcagccgcc taccacggcc tgagcgctga cctcgaacag 40200 
gacaccgggt cggagaccgg cggtgatggt cgtccgggcc gggccgggct cgctgaagta 40260 
ctcgcggtag atcgcgttca tcgcgtcgaa gtcgtccccg tcagcgagga acacattcac 40320 
ggtgatcacg tctgcctcgc tgcagccggc tgaacgcagc gcggtcatga ggttctcgaa 40380 
tgcccggcgc acctgaggtt ccagaccacc cggtgcgagg gatctgtcgg cgaggtagcc 40440 
gcactgcccg gcgattgcca cgatgttgcc gatgcgtacg gcggcggagt aagggcctcc 40500 
cgcgggagga aggtcgacgt ctgtgaacgc ctgacgggac acggtgtcgg ggcctctctg 40560 
tggctggtgg tcatgtccaa ctgacttcgg gagcttgtgg ggtcggttca accggagcct 40620 
cccgtaggag gtcctgagcc ccgtcagaag aacgtgagga cgccgtcgac gacgttgaag 40680 
tcgtcgtcga tgatcgggat gaaggggagc ttgtcgaagg ccgtgcacgg gtgggacatg 40740 
tcgaaagcga cgatgtctcc gggagcgatc gcgagcccgg gggcaacgcg aacgaaggcg 40800 
tgctggtcca tcagcttgaa cgtctccgct tcccgcagcc gctgcctcga gccatccgcg 40860 
cgaaacagct cgcgtggctg tggcagcgtc aggtcatggg ccgcgtcgcg cttcccgatc 4 0920 
gacaggatcg cgagatcagg ctccggccgg gacagcacgg aagcccatgc agtgagtgcg 4 0980 
ttgtggagcc ggagcggttc ctgctcggcc cgtcggccag cgagcggcga gagcttttcg 41040 
tacttgccgg cgtcgtggga gatgtaacag ccgcttcgaa gcactaggtc cacgtcgtcg 41100 
cggacaccgt cccacgagcc gagtccggcg acgacacgat cgaagtacga gcttccgcct 41160 
gccgtcacga tgatgcggtt ggcgtcgaac aggtcgcgac gtgccagctc gagcacgatc 41220 
gaccggacgc cagcgaagaa ggtgtcgagt gcatggaggt cctccggtgt catgccgccg 41280 
gtcacgaggc cctcgtaggc ctcgacgccg accagccgga gatgctcggt cgcggcgacc 41340 



59 



WO 01/16303 



PCT/EP00/08013 



gccgcggcga cctcgag.cgc cgagtcgagg tctcgcacgc cacacctgcc accaggcact 41400 
ccgacctcga ggaggacgtt gagggtgacc ccgtggccgg agtcggcgat ggtatcggac 41460 
atcgcccgca ccgtgtccgc atcgtcgacc aggcagtaga agtcgaacga gctgtcgctg 41520 
gcgatttcgg cagtcaccca acgcagcgcg ggagcatcga cgagttcgtt ggccaggatg 41580 
atgcgctgaa ccccgtagcg gcgcatcacg gccacctgag tgggcgtcgc tgccgtgaac 41640 
gcccacgcac cgtgagacgc ctgaagggcg aagagctggg gtgacatggt ggtcttgccg 41700 
tgcggggcga gaagcgcgtt gttgcgctcg cagtacgcct ccatagtcgc caggttggcg 41760 
ttcatagcgc gtgcgtggag tgtggtcacc ggcagggcga ggtctccgcg ggccacgttc 41820 
cagttctgct ggtccacctc tccgaggagc acacgagcac ccaggggcac gcccttgagt 41880 
gcgtcgtcga ggacggtctc gcggaggtcg tcggcacatt gcaggagtcg ggtccggtcg 41940 
agctgctcga ccgcgtgagt gctcatcgtg tagtggcctt tcgcgccgtc gtggtcagct 42000 
gaccgatgcc gtcgatccgg cactccacac ggtctccgtc gccgatgacg agcgcgccag 42060 
gggttccggt ggagatgatg tctcccggaa acagcggcat catcttcgag tggaagctca 42120 
cgaggctggc gggcttgtgg gtcatgtggg cgaccgtgtt ggtcctgaac tcatcgccgt 42180 
tcttgaccgt cgagatctcg atgtcgtcca agctttcgaa cttctctatc acctcatcga 42240 
gcggtacgag ctccggtccg aagcagaaga acgtccggaa gttcttcgag cgggtgagat 42300 
agcgcggatt gatccgcaag atgtcctcgg cggtctggtc caggatggtc gtgacccccc 42360 
acacgtagtc catcgcctcg gactcgctga cgttctcgca agtcctgccg atgatgagcc 42420 
cgatctcagc ctcggacgtg gtgcgctcac tctgcgccgg aatcacgatc ggctcatccg 424 8 0 
ggccgatgat cgtgtggtcg cacttcacga aggacgccgg ctggtcgggc gccgactcgc 4254 0 
tcaggtccgc tgcgtgatcg ccgtagttca gaccgatgcc ccagatcttc cggggggatc 42600 
ggtacggcgc ggtgaaccgc acctcgtcca cgcgtgtgaa ggcgaggtcg ggggcgttct 42660 
tgacacgctc acgcagccgc gcgaggtggt ctcctgcgat cacgtccatg atggtcgtga 4272 0 
ccgccagctc ggggtctatc tcccggattg ctgcgacgcc gcggtcgctc acgacaacgg 42780 
ggatctctcc ttccgcagcg gtgatggtcg ccaggtgctc ggtcacgaaa gcccgtcctt 42840 
cggtcgaatg gtgaggtagg gggtggggga gttggtttaa ctcgttcccg ggggcacgtt 42900 
cccgcgctgc gtgtgtgagg agaggacaag cgacatcacg tggaggatgt ggtcggcgtt 42 960 
gttccgggct gcccggtcgg cgtcgccagc cagaacggcg gatgccgtcg cgatgtgctc 43020 
cgaggcgcag gagatggagt caccggtcga gggagtgtcg cgttctcgga gtacagcggt 43080 
cagcaccagg gcgttgtcga ccgcgcgctg gaggatctcg ttcccgcaga actcggcgat 43140 
gatgtggtgg aagacgcgat cctcggcgcg gaactgggcg gagtcgccgg actctgccgc 432 00 
gcgcagggat gaggtggcga cctcgtggag cctctcccgt tgggcgctgg tagcccgatg 43260 
tgcggtgaac caggctgccg agctctccag ccccgcccgg tgctcgtagg cgtcgcgaat 43320 
ggcgcggacc gaggcgctga tgacgcgcag ccccttcggg gcagcctcga ccaggcccac 43380 
gtgccgcaag cggagaagcg cctcgcgcac gggggtcttg ctgacattca gctcggccgc 43440 
gagcttggcc tcgctcacac gacgcccagg tgggagtgcc tgacttacga tgctgtcgcg 43500 
gatggcctcg aagacccgat ctgtcagatt gtcgacgcgt gcgtcgaggg gtgcgagcat 43560 
accgttgccg gaccccaccg cggagtctgc ggcggctggg cgatggtgca gctccaccat 43620 
gttcagcgtg cccgcagtgc gtggctgatt cggttccatc gtcccccttc cttggacagt 43680 
ggaatacggt atatggtcct caccgcgtaa gcaaccgccc gaggcgggac tccgtagcgg 43740 
atgaagtcct tgggatatgt cgtataccga gggagagctg atgaagcccg agtcgcaccc 43 800 
ggccttcgag agcctggatc gactggccac gatcgagatg cgtccgccga cgagcccggc 43860 
cggcatcgtg agcccgttct acacgctggc gcggggtgag cggtcggagc cgatcagcct 43920 
tgcgatcgcg cgctctctgc tgcgtcgaca gggtgaaaag gcgttgatcg tcacgggtct 43980 
ggtcgaggaa ggccggttcc cgcaggggga ggtggacggc cccatcggct ctctggcact 44 04 0 
cgcgcgcacg ctcgccggtc ttggtagcga ggtgaccatc gtcatcgacc cagaggccgt 44100 
cggcccggtg gagaagctgg tcgacgctgc gggcctggat ggcgtgacct tgatggagtc 44160 
ccgattctcc agcgcggccg aggcccggga gttcgcggcg cagttcggcg tcgtcgccgc 4422 0 
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catcgagaag ctggggcaga actctgttgg cggccgtcat ctgatctggg gcacgccggt 44280 
caccgccggt gatctgtttg ccgacgacta ccttcgcgcc gccacggaca acggtgctct 44340 
gacgctggcc gtgggagaca acggcaacga gatcgggttc ggcaacattg ccgccgagtc 44400 
cgaggcgctg actccgcgag -gcgtcagcgt tgagggcgga ttcttcgcgt ccacgacggt 44460 
ggatcatctg ctgcctgcat degtgtcgaa tttgggctgc tacgtcatca ccggtgcggt 44520 
ggccatcctg gctcgacgcg caaacctggc agtcaccggg gatctggtgc gcgagtggac 44580 
cgagctgggg ctgcgcgcag gtctacgcag cggcggcgtt gacgatcccg cgttccaagg 44640 
cgacgacggg atcccactgc gtttcgtcgc cgcgcatgcc gagctgatct ccggaatcgt 44700 
tcaccagagc ctgctcggcg acccctgggt ggcagaccag tgacggccac ccgtagcggg 44760 
caaccctcgg ttgccagcct. tcgtcatcat cgccgggaag gggaagtcat cgatggagtt 44820 
cgttgatctg cagcacgcgg agcccatcga ggcatggcgt gccaaccgcg cgaatgcgtg 44 8 80 
ggcgaggccc acgtccggga tctgtcatgg attcgtgcag gccaacctcg tgatgctgcc 44940 
ccaggatgac gcgttcgact tcatgcggtt cgcccagcgg aaccccaagc cgtgcccgat 45000 
cctcgaggtc accgacgcgg gagatccgga gccgaagctg accgcccctg gtgctgacct 45060 
ccgcaccgac ctgcccaaat acagcgtgta ccgggaaggg gcgctggtgg aggagcgcag 45120 
tgacgtcgtt gacctgtgga ccgacgacat ggtcgcgttc ctgcttggct gcagcttctc 45180 
cttcgagcac ttgctgctcg atgccggact cccgatccgg cacatcgacc aggggatcaa 45240 
cggcccggtt tacctcagca accgtgagtg tgtgcccgca ggccagttcc gggggaagct 45300 
agtggtttca atgcgtccca tgcccccgga ccgggtggcg gaggctgtca ccatcacgca 45360 
tgcgcatccc gaggtgcacg ggggccccgt gcacatcggc tcccctgaga agttgggcat 45420 
cgacaacgtt gatgaacctg actggggtga acgagtgcgt atggagccgg gcgacgtgcc 45480 
ggtgttctgg gcctgcggcg tgaccccgca gcaggcggcg atcgaggcga agaccgagct 45540 
aatgatcacg cactcgacgg gtcacatgtt cgtgaccgga atgcggatcg actccgagtg 45600 
aggcgggggc cccggggtgg tgcc 45624 



<210> 5 
<211> 50000 
<212> DNA 

<213> Saccharopolyspora spinosa 
<400> 5 

gggcgcgccg caattcgatg acgttcatgc gccgtgtcgg ggaatcgccg gtggcggcgc 60 
cagcagaggc tgaacttact ggtggtgtgt ccaggaatcg gaggggcagt accgaatgag 120 
cgaagccggg aacctgatag ccgtcatcgg actgtcctgc cgcctacccc aggcgcctga 180 
cccggcttcc ttctggcggt tgctgcgcac cggaacggac gccatcacca cggtcccgga 24 0 
agggcggtgg ggcgacccgt tgcctggtcg ggatgcgccc aagggcccgg aatggggtgg 300 
tttcctggct gatgtcgact gcttcgatcc cgagttcttc gggatctcgc cgcgagaagc 3 60 
ggcagccgtg gatccccagc agaggctggc tctggagctc gcctgggagg caotcgaaga 420 
cgccggtatc cccgccggcg agctgcgcgg tactgccgcc ggtgtgttca tgggggcgat 4 80 
ctctgacgac tacgccgccc tgctgcgcga gagcccgccg gaagtggctg cgcagtaccg 540 
cctcaccggc acccatcgaa gcctgatcgc caaccgcgtg tcctatgtgc tcggcctgcg 600 
cgggccaagc ctgacggtgg attcaggtca gtcctcgtcc ctggtcggcg tgcatctcgc 660 
cagcgagagc ctgcgacggg gtgagtgcac gatcgcactc gccggcggcg tgaacctcaa 720 
cctggccgcc gagagcaaca gcgctctgat ggacttcggc gcgctctccc cggacggtcg 7 80 
ctgcttcacc ttcgatgtgc gggcgaacgg ttacgtccgt ggtgagggcg gcggccttgt 840 
cgtgctgaag aaggccgatc aggcgcacgc cgatggcgac cggatctact gcctcatccg 900 
cggcagcgcg gtcaacaacg atgggggcgg tgccgggctc accgttccgg cggcggacgc 960 



61 



WO 01/16303 PCT/EPOO/08013 



ccaggcggag ctgctgcgcc aggcataccg 
gtatgtcgag ctccacggca gcgcgaccag 
cggagctgtc ctgggggcgg cgagacggcc 
gaccaacgtc ggccatctgg aagcagcggc 
cagcatctgg caccgcgaac tgccgccgag 
cccgctggac gaattgaacc tacgcgtcca 
ggggccgctg ctggccggcg tcagcgcctt 
gctctccggc acgtcccggg tggagcgacg 
gtgggtcttg tcggccagaa caccggtcgc 
gcacctcaat acggccggtc aaagtccgtt 
atccgcgctg ccgcaccggg ccgcgctggt 
gttgaaggcc ctcgctgacg gcgacgacgc 
cgagcgggca gcggtcttcg tctttcccgg 
gcagctgctc gaaacctccg aggttttcgc 
ggcgccgcac ctggattggt ccctgctgga 
ccttgaccac. gacgatgtcg tccagcccgc 
gctctggcgt tcgtggggcg tgcgtccggt 
cgcggcggcc tgcgtcgccg gggccctgtc 
gcgcagcagg cttctgacgg cgctggccgg 
cgccgaagag gtgcggcaaa tcctgttgcc 
gaacggaccg tcgtcgaccc tggtgtcagg 
cgagtgcgca gaccgagagc tccggatgcg 
gcctcacatc gaggttgtcc gggatgagct 
gacgggaagc atcccgatct attcgacgac 
cgccgactac tggtaccgca accttcgtca 
cctgttgaag cgggggtacg acgcattcat 
gaacatccag gaaaccgccg tgcgagcagg 
ccgcggcgaa ggtggcatgc- ggcaggcgct 
agtggccgcg gactggcacg cggtcttcgc 
gacgtacgcc tttcagcgac agcgctactg 
gcccgagagc gacgtgtcga cggcgttgcg 
cgtggactcg acgaccctca cgatgatccg 
cgatccgaaa gaggtggacc cggatcggac 
ggtggtcgag ctgtgcgacc gcctaaacgc 
cgttttcgac tgtcctacgc cggacaagct 
cgagccggct cccatgacgt cacaccggcc 
gatcgggatg ggctgtcggt ttccgggtgg 
ggtcgccgct gggcgggacg tcgtgtccga 
gcgtgcgggg acatcgcacg tgcgcgccgg 
ccccgggttc ttccggattt cgccgcgcga 
gctgctggaa atcgcctggg aagcagtcga 
cggaagtcaa accggggtct tcgtcggcgc 
cgaagcgtcc gaggaggcgg ccgggtacgt 
gggtcgggtt gcgtattcgt tcgggttcga 
ttcgtcgtcg ttggtggccc tgcatttggc 
tctggcgttg gccggtggtg tgaccgtgat 
gcggcagcgt ggtttggcgc cggatgggcg 
caccggctgg tccgagggtg ctggcctggt 



gaacgcgggc gtcgacccgg ccgccgtgca 1020 
ggtcggggat cccgtcgaag cagcagccct 1080 
cggcgacgag ctgcgtgtgg ggtcggcgaa 1140 
gggcgtcacc gggttgctga agaccgcact 1200 
tcttcatttc accgccccca acccggaaat 1260 
gcgtgatctg cggccgtggc cggagagcga 1320 
cggaatggga ggcacgaact gccacctggt 1380 
gcgcagtgga cccgctgagg cgaccatgcc 1440 
attgcgtgcg caggcggcgc gcttgcacac 1500 
ggacgtcgcc tactcactgg cgaccactcg 1560 
cgcggacgac gaaccgaaac tgctcgccgg 1620 
gcccacgctg tgccacggcg cgacttccgg 1680 
acagggcagc cagtggatcg ggatgggtag 174 0 
ggcgtcgatg tcggactgcg ccgacgcatt 1800 
tgtgctgcgc aacgcggccg gcgctgcgca 1860 
gctgttcgcc atcatggtct cgctcgcgga 1920 
ggcggtcgtc gggcactcgc agggggagat 1980 
cgtccgcgat gccgccaggg tggtggcggt 2040 
cagtggcgcg atggcctcgt tgcagcatcc 2100 
ctggcgcgat cggatcggcg tggcgggggt 2160 
ggaccgggag gcgatggcgg aactgctggc 2220 
ccggattccc gttgaatacg cctcccattc 2280 
gctggggctg ttggcgccgg tcgaacccag 2340 
gaccggggac ctgctggacc ggccgatgga 2400 
accggtgctg ttcgaagcgg ccgtcgaggc 2460 
cgagatcagc ccacacccgg tgctgactgc 2520 
gcgggaggta gtggcgctcg ggacactccg 2580 
gacgtcgctg gccagagcac acgtacacgg 264 0 
cggtaccggg gcgcagcggg tcgacctgcc 2700 
gctggacgcg aagcttcccg acgtcgccat 2760 
ggaaaagctg cggtcttcgc cgagggcgga 2820 
ggcacaggca gccgtggtcc tcggccactc 2 880 
gttcaaggac ctgggcttcg attcctcgat 2 940 
cgccacaggt ctgcgactcg caccgagcgt 3 000 
cgcccgccag gtacggacgt tgttgttggg 3 060 
ggactccgat gcggacgagc ctatcgccgt 3120 
ggtgtcctcg cccgaggagt tgtggcagtt 3180 
gttcccggct gaccgaggtt gggacctgga 324 0 
cgggttcttg catggcgccc cggattttga 3300 
ggcgttggcg atggatccac agcagcggtt 3360 
acgaggcggg atcaacccgc agcacctgca 3420 
gacctccctg gactacgggc cacgcctgca 3480 
gctcaccggc agcaccacga gtgtggcgtc 354 0 
gggccctgcg gtgacggtgg atacggcgtg 3600 
gtgtcagtcg ttgcgttcgg gtgagtgtga 3660 
ggccacgccg gggatgttcg tggagttttc 372 0 
gtgcaagtcg ttcgcggagg ccgccgacgg 3780 
tctactggag cggttgtcgg atgcccggcg 3 84 0 
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gaatgggcat gaggtgctgg cggttgttcg tggtagtgcg gtgaatcagg acggtgcgtc 3900 
gaatggtttg accgcgccga atggttcgtc gcagcagcgg gtgattgccc aggcattggc 3960 
gagtgcgggg ttgtcggtgt ccgatgtgga tgctgtggag gcgcatggga cgggcacgcg 4020 
gcttggtgat ccgatcgagg cgcaggcgct gatcgccacc tacggccagg gccggcttcc 4080 
ggaacggcca ttgtggttgg gctcgatgaa gtcgaacatc ggtcacgcgc aggcagctgc 4140 
ggggatagcc ggcgtcatga agatggtgat ggcgatgcgg cacgggca'gc taccgcgcac 4200 
gttgcacgtg gatgagccga cttctggggt ggattggtcg gcggggacgg ttcaactcct 42 60 
tacggagaac acgccctggc ccgggagtgg tcgtgttcgt cgggtggggg tgtcgtcgtt 432 0 
cgggatcagt ggtactaacg cgcacgtcat cctcgaacag cccccgggag tgccgagtca 43 80 
gtctgcgggg ccgggttcgg gctctgtcgt ggatgttccg gtggtgccgt ggatggtgtc 4440 
gggcaaaaca cccgaagcgc tatccgcgca ggcaacggcg ttgatgacct atctggacga 4500 
gcgacctgat gtctcctcgc tggatgttgg gtactcgctg gcgttgacac ggtcggcgct 4560 
ggatgagcga gcggtggtgc tggggtcgga ccgtgaaacg ttgttgtgcg gtgtgaaagc 4620 
gctgtctgcc ggtcatgagg cttctgggtt ggtgaccgga tctgtggggg ctgggggccg 4680 
catcgggttt gtgttttccg gtcagggtgg tcagtggctg gggatgggcc gggggcttta 474 0 
ccgggctttt ccggtgttcg ctgctgcctt tgacgaagct tgtgccgagc tggatgcgca 4800 
tctgggccag gaaatcgggg ttcgggaggt ggtgtccggt tcggatgcgc agttgctgga 4860 
tcggacgttg tgggcgcagt cgggtttgtt cgcgttgcag gtgggcttgc tgaagttgct 4920 
ggattcgtgg ggggttcggc cgagtgtggt gttggggcat tcggtgggcg agttggcggc 4980 
ggcgttcgcg gcgggtgtgg tgtcgttgtc gggtgcggct cggttggtgg cgggtcgtgc 5040 
ccggttgatg caggcgttgc cgtctggcgg tgggatgctg gcggtgcctg ctggtgagga 5100 
gctgttgtgg tcgttgttgg ccgatcaggg tgatcgtgtg gggatcgccg cggtcaacgc 5160 
tgcggggtcg gtggtgctct ctggtgatcg ggatgtgctc gatgaccttg ccggtcggct 5220 
ggacgggcaa gggatccggt cgaggtggtt gcgggtgtcg catgcgtttc attcgtatcg 5280 
gatggatccg atgctggcgg agttcgccga attggcacga accgtggatt accggcgttg 5340 
tgaagtgccg atcgtgtcga ccttgaccgg agacctcgat gacgctggca ggatgagcgg 54 00 
gcccgactac tgggtgcgtc aggtgcgaga gccggtccgc ttcgccgacg gtgtccaggc 54 60 
gctggtcgag cacgatgtgg ccaccgttgt cgagctcggt ccggacgggg cgttgtcggc 5520 
gctgatccag gaatgtgtcg ccgcatccga tcacgccggg cggctgagcg cggtcccggc 5580 
gatgcgcagg aaccaggacg aggcgcagaa ggtgatgacg gccctggcac acgtccacgt 5640 
acgtggtggt gcggtggact ggcggtcgtt cttcgccggt acaagggcga agcaaatcga 57 00 
gctgcccacc tacgccttcc aacgacagcg gtactggctg aacgcgctgc gtgaatcttc 5760 
cgccggcgac atgggcaggc gtgtcgaagc gaagttctgg ggcgccgtcg agcacgaaga 5820 
tgtggaatcg cttgcacgcg tattgggcat tgtggacgac ggcgctgctg tggattccct 58 80 
gagaagcgcc cttccggtgt tggccggttg gcagcgaacc cgcaccaccg agtccattat 5940 
ggatcagcgg tgttaccgaa ttggctggcg gcaggtagcc ggactcccgc cgatgggaac 6000 
tgttttcggt acctggctgg tcttcgcgcc tcatggctgg tccagcgaac cggaggtggt 6060 
ggactgcgtt acggcactgc gggcacgtgg tgcctcggtg gtgttggtgg aagctgatcc 6120 
cgacccgacc tccttcggcg accgggtacg aaccctgtgt tcgggccttc cggatcttgt 6180 
tggcgtgttg tcaatgttgt gcttggaaga atcggtcctt ccgggatttt ctgcggtgtc 6240 
acggggtttt gcgttgaccg tggagttggt gcgggttttg cgggcagctg gtgcgactgc 63 00 
ccggttgtgg ttgctgacgt gtggtggcgt gtcggtggga gatgtaccgg ttcgtccagc 6360 
gcaggccctg gcgtgggggt tggggcgtgt tgtggggttg gagcatccgg actggtgggg 6420 
cggcttgatc gatattccgg tcttgttcga cgaagacgct caagagcggt tgtcgattgt 6480 
gctggcaggt ctcgatgagg acgaggtcgc gatccgtcct gacggcatgt tcgcgcgtcg 6540 
gttggtacgc cacactgtct cagctgatgt gaagaaggcg tggcgcccca ggggatcggt 6600 
gctggtgacg ggcggcacgg gtggtttggg ggcgcacgtt gctcgctggc tggccgacgc 6660 
cggagccgaa catgtggcga tggtgagtcg acgcggcgag caggcaccga gtgctgagaa 6720 
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gttgcggacg gaactggagg atctgggtac 
cgatcgcgag gcgctcgccg aagtgctgaa 
ggtagtgcat gcggcaggcg tgatcgagac 
tttcgatcac gtgttgtccg caaaggtggc 
cgatgtggaa ttggacgcgt tcgtcttgtt 
gggacacggg gcttacgcag cggcgaatgc 
gtcgcgaggg ctggtcgcga ctgcggtggc 
ctccggagaa acaggagacc agctgcgccg 
cgccatcgcc ggaatccggc aggccgtgga 
tgtcgattgg gcacgtttca gcgcgggatt 
cgaactggcc gaggtcaagg aactcctcgt 
cgacgcgtcg ttggaatggc ggcagcgatt 
gctgatcctg gagctggtac gcggcgaaac 
ggccgttgca tcggaacgag ccttcaagga 
actccgcgtt cggctcaatc gagctaccgg 
ccatcccacg cctgcggaac tggctgcgga 
aggagcaggc attcccgagg aggacgaggc 
gttcccggcc ttgcgcgagg caggcttggt 
ggtcgactcc ggtatctcct cggacgatgc 
agccggcctc gtcgaagcag cgctgggcga 
atgacggtga ccaccagtta cgaagaagtt 
aacgaacgcc tccggcgcgg cagggatcgg 
atcgtggcga tgagttgtcg ttatcccggt 
ctggctgccg gcggtgtgga cgcgatctcc 
gatggcgtgt tcgttccgga ctccgatcgt 
tttcttcagg gcgtgtcgga gttcgacgcg 
ctggcgatgg atccgcagca gcggttgctg 
gctgggctgg agcagcggtc gacacgcggt 
ggccaggact acgcgtcgtg gttgcggacg 
acgggcggtg cggcagcggt tctttcgggc 
cctgcggtga cggtggatac ggcgtgttcg 
caagcactgc gggccggtga gtgcgacctt 
acgccgaagg tgttcctgga gttctcccgc 
aagtcgttcg cggcgggtgc ggatggcact 
ctggagcggt tgtcggatgc ccggcggaat 
agtgcggtga atcaggacgg tgcgtcgaat 
cagcgggtga ttacccaggc gttggcgagt 
gtggaggcgc atgggacggg cacgcggctt 
gccacctacg gccgtgatcg tgatcctggc 
aacatcggtc atacgcaagc ggcggcgggt 
atgcggcacg ggcagctgcc acgcacgttg 
tggtcggcgg ggacggttca actccttacg 
gttcgtcggg tgggggtgtc gtcgttcggg 
gaacagcccc cgggagtgcc gagtcagtct 
gttccggtgg tgccgtggat ggtgtcgggc 
acggcgttga tgacctatct ggacgagcga 
tcgctggcgt tgacacggtc ggcgctggat 
gaaacgttgt tgtgcggtgt gaaagcgctg 



ccgggtgtcg atcgtgtcat gcgatgtgac 6780 
agcccttccg gctgaaaacc cgttgaccgc 684 0 
tggtgatgcg gcggcaatga gcctggctga 6900 
cggtgccgcg aatctggatg ccttgttggc 6960 
ctcatcggtg tcaggagttt ggggcgctgg 7020 
ctatctggat gcgctcgcgg aacagcgtcg 7080 
ctgggggccg tgggccggcg agggcatggc 714 0 
atacggcctt tccccaatgg ctccgcagca 72 00 
acaggacgaa atttccctgg tagtggccga 7260 
gctggcggct aggccgcggc cgctgctgaa 732 0 
cgatgcccag cccgaggcgg gagtccttgc 7380 
gtccgcggca ccgaggccga cacaggaaca 7440 
cgctctggtg ctgggacacc ccggggcagc 7500 
cagcggattc gactcgcagg ccgcggtcga 7560 
cctccagttg ccatcgacaa ttatcttcag 7620 
gctgcgggcg aggcttcttc ccgagtccgc 7680 
gcgaatcaga gcggcactga cgtcgatccc 7740 
gagtccgctg ctcgcacttg ccggacaccc 7800 
ggccgcgacc tcgatcgatg cgatggatgt 7860 
acgcgagtcc tgagaccgcc gacctgggag 7920 
gtcgaggcac tgcgagcatc gctcaaggag 7980 
ttctccgcgg agaaggacga tcccatcgcg 8040 
caggtctcct cgccggagga cctgtggcaa 8100 
gaagttccgg gggatcgcgg atgggacctg 8160 
cctggcacgt cgtatgcctg cgcgggcggt 8220 
ggtttcttcg ggatttcgcc gcgtgaggcg 82 80 
ctggaagtcg cgtgggaggt cttcgagcgg 8340 
tcccgcgttg gcgtgttcgt cggcaccaat 84 00 
ccgccgcctg cggtggcagg tcatgtgctg 8460 
cgggttgcgt attcgttcgg gttcgagggt 8520 
tcgtcgttgg tggcgttgca cctggcgggg 8580 
gcccttgccg gtggcgtcac ggtgatgtcg 864 0 
caacggggtc tcgcgccgga tgggcggtgc 8700 
ggatggggtg agggtgccgg actgttgttg 8760 
gggcatgagg tgctggcggt tgttcgtggt 8820 
ggtttgaccg cgccgaatgg ttcgtcgcag 8880 
gcggggttgt cggtgtccga tgtggatgct 8940 
ggtgatccga tcgaggcgca ggcgctgatc 9000 
cggccgttgt ggttggggtc ggbcaagtcg 9060 
gtggctggtg tgatcaagat ggtgatggcg 9120 
cacgtggaat cgccgtcgcc ggaggtggat 9180 
gagaacacgc cctggcccag gagtggtcgt 9240 
atcagtggta ctaacgcgca cgtcatcctc 93 00 
gcggggccgg gttcgggttc tgtcgtggat 9360 
aaaacacccg aagcgctatc cgcgcaggca 9420 
cctgatgtct cctcgctgga tgttgggtac 94 8 0 
gagcgagcgg tggtgctggg gtcggaccgt 9540 
tctgccggtc atgaggcttc tgggttggtg 9600 
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accggatctg tgggggctgg gggccgcatc gggtttgtgt tttccggtca gggtggtcag 9660 
tggctgggga tgggccgggg gctttaccgg gcttttccgg tgttcgctgc tgcctttgac 9720 
gaagcttgtg ccgagctgga tgcacatctg ggccaggaaa tcggggttcg ggaggtggtg 9780 
tccggttcgg atgcgcagtt gctggatcgg acgttgtggg cgcagtcggg tttgttcgcg 9840 
ttgcaggtgg gcttgctgaa gttgctggat tcgtfgggggg ttcggccgag tgtggtgttg 9900 
gggcattcgg tgggcgagtt ggcggcggcg ttcgcggcgg gtgtggtgtc gttgtcgggt 9960 
gcggctcggt tggtggcggg tcgtgcccgg ttgatgcagg cgttgccgtc tggcggtggg 10020 
atgctggcgg tgcctgctgg tgaggagctg ttgtggtcgt tgttggccga tcagggtgat 10080 
cgtgtgggga tcgccgcggt caacgctgcg gggtcggtgg tgctctctgg tgatcgggat 10140 
gtgctcgatg accttgccgg tcggctggac gggcaaggga tccggtcgag gtggttgcgg 102 00 
gtgtcgcatg cgtttcattc gtatcggatg gatccgatgc tggcggagtt cgccgaattg 102 60 
gcacgaaccg tggattaccg gcgttgtgaa gtgccgatcg tgtcgacctt gaccggagac 10320 
ctcgatgacg ctggcaggat gagcgggccc gactactggg tgcgtcaggt gcgagagccg 103 80 
gtccgcttcg ccgacggtgt ccaggcgctg gtcgagcacg atgtggccac tgttgtcgag 10440 
ctcggtccgg acggggcgtt gtcggcgctg atccaggaat gtgtcgccgc atccgatcac 10500 
gccgggcggc tgagcgcggt cccggcgatg cgcaggaacc aggacgaggc gcagaaggtg 10560 
atgacggccc tggcacacgt ccacgtacgt ggtggtgcgg tggactggcg gtcgttcttc 10620 
gccggtacgg gagcgaaaca aatcgagctg cccacctacg ccttccaacg acagcggtac 10680 
tggctggtgc catcggattc cggtgatgtg acaggtgccg gtctggccgg ggcggagcat 10740 
ccgctgttgg gtgctgtggt gccggtcgcg ggtggtgacg aggtgttgct gaccggcagg 10800 
atttcggtgc ggacgcatcc gtggctggcc gaacaccggg tgctgggtga agtgatcgtt 10860 
gcgggcaccg cgttgctgga gatcgccttg cacgcggggg aacgtcttgg ttgtgaacgg 10920 
gtggaagagc tcaccctgga agcaccgctg gtcctgccgg agcgcggggc gatccaggtt. 10980 
cagctgcgag tgggcgcgcc cgagaattcc ggacgcaggc cgatggcgct gtattcacgc 11040 
cccgaagggg cggcggagca tgactggacg cggcacgcca cgggccggtt ggcgccaggc 11100 
cgcggcgagg cggctggaga cctggccgac tggccggctc ctggcgcgct gccggtcgac 11160 
ctcgacgaat tctatcggga cctcgcagag cttgggctgg agtacggccc gatcttccaa 11220 
gggctcaagg cggcctggcg gcaaggggac gaggtgtacg ccgaagccgc gctgccggga 112 80 
acggaagatt ctggtttcgg ggtgcatccg gcactgctgg acgcggctct gcacgcaacg 11340 
gctgtccgag acatggatga cgcacgcttg ccgttccagt gggaaggtgt gtccctgcac 11400 
gccaaggccg cgccggcttt gcgggtccgc gtggtcccgg ctggtgacga tgccaagtcc 11460 
ctgctggttt gtgatggcac cggtcgaccg gtgatctcgg tggaccgact cgtattgcgg 11520 
tcggctgcgg cccggcggac cggtgcgcgc cgacaggccc atcaagctcg gttgtaccgg 11580 
ttgagctggc caacggttca actgccgaca tccgctcagc caccgtcctg cgtgcttctc 1164 0 
ggcacctcag aagtgtccgc tgacatacag gtgtatccgg acctccggtc gttgacggct 11700 
gcgttggatg ccggtgccga accacccggc gtcgtcatcg cacccacgcc ccccggcggt 11760 
ggacgaacag cggatgtccg ggagacgact cggcatgcac tcgacctggt acaaggctgg 11820 
ctttccgatc agcgactcaa cgaatcccga ttgctcctgg tgacacaggg agcagtggcc 11880 
gtggagccgg gcgaacccgt gaccgatctg gcgcaggccg cgctctgggg actgctgcgg 11940 
tcgacgcaga ccgaacaccc tgatcgcttc gtcctcgtcg atgtgcctga gcccgcgcaa 12000 
ctcctccccg cgctgccggg ggtgctggcc tgcggcgaac ctcagctcgc gttgcgacgt 12060 
ggcggcgctc atgcgcccag actggctgga ctgggcagcg atgacgtcct gcccgtgccg 12120 
gacggcaccg ggtggcgatt ggaggccacg cgcccgggaa gcctggatgg gttggcattg 12180 
gtggacgaac cgacggccac ggcaccgctg ggtgacggtg aggtcaggat tgcgatgcgc 12240 
gcggccgggg tgaacttccg ggatgcgctc atcgcgctcg gtatgtatcc cggtgtggca 12300 
tcgctgggca gtgagggcgc cggggtcgtg gtggagaccg gccccggcgt caccggcctg 12360 
gcacccggcg accgcgtgat gggaatgatc ccgaaggcgt tcgggccgct cgcggtcgcc 1242 0 
gaccatcgca tggtgacgag gattcccgct ggttggagct tcgcgcgggc cgcatcggtg 12480 
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ccgatcgtct ttctcaccgc ctactacgcg 
gagtcgttgc tggttcattc ggccgccggt 
aggcacctcg gtgcagaggt gtacgccacc 
ctgagccgag aacacctcgc ttcgtcgcgg 
gcaaccggcg gacgcggcgt cgacgtcgtg 
gcgtctctgc gaatgctgcc gcgcggtggc 
cgtgaccccg tcgaggtcgc cgatgcgcat 
gtagaggcag gcccgcagcg aatcggcgag 
ggacgcgtgc tggagcccct gcctgtcacg 
ctacggcacc tgagccaagc gcggcatgtg 
tgggacgccg caggcacggt tctggttacc 
gcccggcacc tcgtgatcga gcgcggggtg 
cccgcagcca gtggcgctgc tgagctcgtg 
tccttgcagg cttgcgatgt cgccgatcgt 
ccggacgagc atccgttgac cgccgtggtg 
tccgaatcgc tcaccgtgga gcggctggac 
cggaatctgc tcgagctgat cgacccggac 
ggtgtgctcg gcagcggtgg gcagggtaac 
ttggcgcagc aaaggcagtc gcgcggccta 
gcggaacatg gcatggccag caccttgcgc 
gggttgctgc cgatctcgac cgaggagggg 
gcgcataccg tggtggcgcc ggttcgattc 
ttctccgtcc tgcaaggttt ggtcgggccg 
gatgccgaga gcctccggaa acggttggga 
ctgctggacc tcgtccgcat gcatgtggcg 
atcaccgcgg acggcacgtt caaggtgctg 
cgcaaccgga tcaacggggc gacggggctg 
ccgacgccgg atgcgctcgc cgcgcacctc 
gggacattcg aggaactcga caggtgggcg 
gccacgcggg cgcagatcac cacccggcta 
tccggcggaa ccggcggcgg ctccgtgccg 
cttttccaac tcctcgacaa cgatctcgaa 
aacttcctgg agggaaacgc cacatgtcga 
gtgcgctcgt ggatctgcac caggcgcgcg 
gggaacccat cgcgatcgtg .gcgatgggct 
aagggctgtg gaaactggtc gcctccggtg 
gtggttggca cctcgacgag ctctacgatc 
cccggcacgg cggcttcctc cacgacgccg 
gcccccgtga ggcgctcgcg atggacccgc 
agaccgtcga atccgctggg atggacccga 
tcgcgggatt gatgtacgag ggctatgaca 
aaggctatct cggaaccggc aatgcgggaa 
tcgggttcga gggcccagcg gtgacggtag 
tgcatttggc gtgtcagtcg ttgcggcagg 
tgacggtgat gtcgacgccg gagaggttcg 
cggatgggcg gtgtaagtcg ttcgcggcgg 
ccggtttggt gttgctggag cggctgtcag 
cggttgttcg tggtagcgcg gtgaatcagg 
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ctggttgatc tcgccgggtt gagaccaggg 1254 0 
ggggtgggga tggccgcgat. ccaactcgcc 12600 
gctagcgagg acaagtggca agccgtggag 12660 
acgtgcgatt tcgagcagca gttcctcggg 12720 
ctcaactccc tcgccgggga gttcgccgat 12780 
cgtttcctgg agttggggaa gacggatgtt 12840 
ccgggcgtgt cttaccaggc tttcgatacc 12900 
atgcttcacg agctggtgga gttgttcgag 12960 
gcttgggacg ttcggcaggc gcccgaggcg 13020 
ggaaagctgg tgctcaccat gcctccggtg 13080 
ggcggaacgg gagcacttgg cgcagaggtc 13140 
cgaaacctgg tcctcgtcag caggcgcggt 132 00 
gcgcaactga cggcctacgg tgccgaggtt 13260 
gagaccttgg cgaaggtgct tgccagcatc 13320 
cacgcggctg gtgttctcga cgacggagtg 13380 
caggttctgc gcccgaaggt cgatggcgcg 13440 
gtggccctcg tgttgttctc gtcggtgtcg 13500 
tacgcggcgg ccaactcctt cctcgacgca 13560 
ccgacgagat cattggcctg ggggccctgg 13620 
gaagccgagc aggatcgatt ggcgcgatct 13680 
ttgtcccagt . tcgacgccgc gtgcggcggc 13740 
agccgcttgt ccgacgggaa cgcgatcaag 13800 
catcgcgtca acaaagcggc gactgcggat 13860 
cgcttgccgg atgcagaaca acatcggatt 13 920 
gcagtgctcg gattcgccgg ttctcaggag 13980 
ggcttcgact cgttgaccgt ggtcgagttg 14 04 0 
cgactgcccg ccaccctggt gttcaactac 14100 
gtcaccgcgc tgtccgcaga ccgcctggcc 14160 
gcgaacctgc ccacgctggc cagggatgag 14220 
caggcgatct tgcagagcct ggcggacgtg 14280 
gaccggctca gatcggccac ggacgacgag 14340 
cttccctgat gcctcagccg gagccttcgc 14400 
atgaagagaa gctccgggag tacttgcggc 14460 
agcggctgca cgaggcggag tcgggagagc 14 520 
gccggtaccc gggtggggtg caggacccgg 145 80 
gcgacgccat cggtgaattc cccgctgatc 14640 
ccgacccgga tcagcccgga acctgctaca 14700 
gcgagttcga cgcgggattc ttcgacatca 14760 
agcagcggct gctgctggaa atctcctggg 14820 
ggtccttgcg ggggagccgc accggggtgt 14880 
ccggcgccca ccgggcagga gaaggtgtcg 14940 
gcgtcgcctc tggtcgggtt gcgtatgcgt 15000 
acacggcgtg ctcgtcgtcg ttggtggcgc 15060 
gcgagtgtga tctggcgctg gccggtggag 15120 
tggagttctc ccgtcagcgt ggtctcgcac 15180 
ctgcggatgg aaccggttgg ggtgagggtg 15240 
acgccaggcg gaacgggcat cgggtactgg 15300 
acggtgcgtc gaacggattg acggccccga 15360 
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acgggctggc ccaggagcgg gtcattcagc aggtgctcac gagtgcgggg ctgtcggcgt 15420 
ccgatgtgga cgctgtggag gcgcatggaa cgggtacgcg gcttggtgat ccgatcgagg 15480 
cgcaggctct gatagccgcc tatggacagg atcgggaccg ggaccggccg ctgtggttgg 15540 
ggtcggtcaa gtccaacatc ggtcatacgc aggcggctgc gggcgtcgct ggtgtgatca 15600 
agatggtcat ggcgatgcgg cacggggagc tgccgcgcac gttgcacgtg gacgagccga 15660 
attcgcacgt ggactggtcg gctggtgcgg tccgactcct gaccgagaac atccgctggc 15720 
cagggacggg tacgcgccgc gctggagtgt cgtcgttcgg ggtaagcggt accaacgcac 157 80 
acgtcatcct cgaacacgac ccgctcgccg tgaccgagaa cgaggaagca gcgcagtccc 15840 
cagcacctgg gatcgtgccc tgggcgttgt ccgggcggtc gtcgacggcg ctgcgggccc 15900 
aggccgaacg gctgcgcgag ctgtgcgagc agaccgatcc cgaccccgtc gatgtcggtt 15960 
tctcactggc cgccacgcgc acggcttggg agcaccgagc ggtggtgctt ggtcgggaca 16020 
gcgctacgtt gcgctccggg cttggcgttg ttgccagcgg tgaaccagcg gtcgatgtcg 16080 
ttgaggggag cgtcctggac ggcgaggtcg tcttcgtctt ccccggtcag ggctggcagt 1614 0 
gggccggtat ggcagtcgac ctgctggacg cttcgccgac gttcgcgcgc cacatggacg 16200 
agtgcgccac cgcgctgcgg aggtacgtgg actggtcgtt ggtcgacgtg ctgcgcggag 16260 
cggagaactc cccaccgctg gaccgggtgg acgtgctcca gcccgcgtcc ttcgcggtga 16320 
tggtgtcgct cgccgaggtg tggcgttcct acggggtgag gccggcggcc gtcgtcggcc 16380 
acagtcaagg cgaaatcgcc gcggcctgcg cagccggggt gctgccgctg gaggatgcgg 1644 0 
ccaggcttgt cgcattgcgc agcagagcgt tgaagggact ttcggggcgg. ggtggcatgg 16500 
cgtcgctggc ctgccctgcg gatgaggtcg cggcattgtt cgcgggatcg ggcggccgtc 16560 
tggaagttgc ggcgatcaac ggcccgcgat cggtcgtggt gtccggcgat ctggaagcgg 16620 
tggacgaact gctggcagag tgcgctgaaa aggacatgcg tgcacgccgt atccccgtcg 16680 
actacgcctc gcattcagcg cacgtggagg tggttcggag cccggtgctg gcggccgccg 1674 0 
ccggggtgcg acaccgggac ggccaggtgc cgtggtggtc gacggtgatc ggcgactggg 16800 
tggatccggc caggctggac ggcgagtatt ggtatcggaa cctccggcag ccggtccggt 16860 
tcgaacacgc cgtgcagggc ctggtcgagc ggggattcgg cctgttcatc gaaatgagtg 16920 
cgcatccggt gctgaccacg gcggtcgagg aaaccggtgc ggagtcggag accgccgtgg 16980 
ccgcggtagg taccttgcga cgtgactcgg gcggcctccg gaggttgttg cattcgctgg 17040 
ccgaggcgta cgtgcgcggc gccaccgtgg actgggccgt ggcgttcggg ggcgcgggcc 17100 
gacggctgga cctgccgacc tacccgttcc agcgccagcg gtactggctg gacaagggag 17160 
ctgcctccga cgaggctcgt gcggtctcgg acccggcggc gggctggttc tggcaagccg 1722 0 
tggcgcgcca agacctgaaa agcgtgtccg atgccctcga tctcgacgcc gacgcaccgc 17280 
tgagcgcaac acttccagcc ctgtccgtct ggcaccgtca ggaacgagaa agggtcttgg 1734 0 
cagacggttg .gcggtaccga gtcgactggg tacgggtggc cccgcagccg gtccggagaa 17400 
cgcgggaaac ctggctcctg gtcgttcccc cgggcggcat cgaggaagcg ctggtcgaac 17460 
ggctgacgga tgcgttgaac acgcgaggga tcagcaccct gcgcctcgac gtgccaccgg 17520 
cggcgaccag tggcgaactc gcaaccgaac tccgcgccgc agccgacggt gacccggtga 17580 
aggcaatcct gtcgctcacc gcgttggacg agcgacccca ccccgaatgc aaggacgtcc- 17640 
cgagcgggat tgccttgctg ctgaacctgg tcaaggcgct cggtgaagcc gacctcagaa 17700 
ttcctctgtg gaccatcacg cgtggtgcgg tcaaggcagg ccccgcagat cggctgctgc 17760 
gcccgatgca ggcgcaagca tggggtctgg ggcgagtagc cgcactcgaa caccccgagc 17820 
gctggggtgg gctgatcgac ctgccggatt cgctggacgg cgacgtcctc acgaggctgg 17 880 
gcgaagcgct caccaacggc ttggcggaag accaactggc gattcgccag tcgggcgtgc 17 94 0 
tggcccggcg actggtaccc gccccggcga atcagcccgc tggacgtaag tggcgccccc 18000 
gagggagcgc gctgatcacg ggcggactcg gcgcggtggg cgcacaggtg gcgaggtggt 18060 
tggccgaaat cggagccgag cgaatcgtgc tcaccagtcg acggggcaac caagcagcag 18120 
gcgccgccga gctggaagcc gaactccggg cccttggagc gcaagtgtcc atcgtggctt 18180 
gcgacgtgac cgatcgtgcc gagatgtccg cactactggc cgagttcgac gtcaccgcgg 18240 
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tgttccacgc ggccggagtc ggtcggctgc 
tggccgaaat atgcgcggcg aaggtccgcg 
gcaccgatct cgatgccttc gtcctgttct 
gtcagggcgc ttacggcgcg gcgaacgcat 
cacgcggtct - gccggcaacc tcgatctcct 
acggcgcggc gggcgaacac ctgcggcgac 
ccatcctggc tctgcaggaa gtacttgacc 
tggactggga ccgattcgtt cccacgttcg 
aagtgccggc ggcgagaaag gcgatgcccg 
cgttcgcccg caatctcgcg gagctgccgg 
tggtgtgcgc ccaggtggca accgtgctcg 
agcgggcgtt ccgcgcgctc gggttcgact 
tgaccaccgc caccgggttg cgcctgccga 
ccgccttggc cgctcacctg ctcgaggagc 
ccgctgccag cgcgcccgcg agtgacgaac 
ttccgggtgg cgcgcactcg ccggaagacc 
tgatcggcga gttcccctcc gaccggggct 
cttccaggcc tggaacgacg tatgcgcgga 
tcgatgccga cctgttcggc atcagcccac 
ggttggtgct cgaaatcgcc tgggaagccc 
tgaagggcag tggggtcggc acgtacatcg- 
tgcggcagtt tcccgaggag gcggagggct 
tgtcgggtcg ggtcgcgtat tcgtttggtt 
cttgttcgtc gtcgttggtg gcgttgcatc 
gtgatctggc gttggccggt ggtgtgaccg 
tctcccgtca gcgcggtttg gcgccggatg 
acggcaccgg ctggggcgaa ggcgcgggcc 
accggaatgg gcatcgggtg ttggcggtgg 
cctcgaacgg actggcggcg ccgaacggtc 
tcgcgaatgc ggctctttcg gcgtccgatg 
ccaggctggg tgatccgatc gaggcgcagg 
agcgggatcg gcccttgtgg ctggggtcgg 
cggcgggtgt tgccggtgtg atcaagatgg 
cctcgctgca cgcggatgag cccacgtcgg 
tcctcgccga acaggtacct tggccggagt 
cgttcgggat cagcggcacc aacgcacatg 
atagtacagc ggagacggac aaaacagaat 
cctggttggt gtcgggaaag acgacggatt 
ctcaggtcga gtcccggccg gagcagcgtt 
gccgagccgc gctggatgaa cgcgctgtcg 
ctggactggc ggcgttggcc gccggtcagg 
cttctgctcg gttcgggttc gtgttctcgg 
gagcgctcta ctcgaagttt ccggtgttcg 
tggaggcaca tctgggggaa gaccgccggg 
agctgctgga tcagacgctg tgggcgcagt 
tggggctgct gggttcgtgg ggcgttcggc 
agttggccgc cgcgtttgcg gctggcgtgt 
ccgcgcgcgc ccggttgatg caagccctgc 



tgccgttggc ggagaccgac cagaacggcc 18300 
gcgctcaggt gctggacgaa ctgtgcgaca 18360 
cctcgggtgc cggggtatgg ggcgggggcg 18420 
tcttggacac actcgccgaa caacgccgag 1848 0 
ggggcagttg ggccggcggc ggcatggccg 1854 0 
gcgggatacg tccgatgccg gcggcgtcgg 18600 
aggatgagac gtgcgtgtcg atcgctgatg 18660 
ccgcgactcg cgccacccgg ttgttcgacg 1872 0 
cgaatgggcc ggcagaacca ggcggctcgc 18780 
aagcccaacg acgccacgaa ctggtggatc 18840 
ggcacggcag tcgcgaggaa gtccagcccg 18900 
ccctcatggc ggtggatctg cgcaatcgtt 18960 
ccacaaccgt cttcgactac ccgaatccgg 19020 
tggtgggtga tgtcgcgtcg gctgcggtga 19080 
cgatcgcgat cgtcgcgatg agctgccggt 19140 
tgtggcggct ggtcgccgcc ggcacggagg 19200 
gggatgcgga aggcctttac gatccggatg 19260 
tggcgggatt cctctacgac gccggtgagt 19320 
gtgaggcgtt ggcgatggat ccgcagcagc 19380 
tcgaacgggc cggaatcgat ccgttgtcct 19440 
gcgctggaag ccgtgggtac gcgacggatg 19500 
acctgctgac gggtacctcg gccagtgtgc 19560 
tcgagggtcc tgcggtgacg gtggatacgg 19620 
tggcgtgcca gtcgttgcgt tcgggcgagt 19680 
tgatgtcgac gccggagatg ttcgtggagt 19740 
ggcggtgcaa gtcgttcgcg gagagcgcgg 19800 
tgttgttgct ggagcggttg tcggacgccc 19860 
ttcgtgggtc agcggtgaat caggacggcg 19920 
cgtcgcagca gcgggtgatc aaccaggcac 19980 
tggatgcggt ggaggcacat ggcaccggga 20040 
cattgatcgc aacgtatggg caggcccggg 20100 
tcaagtcgaa catcggtcat acgcaggccg 2 0160 
tgatggccat gcggcacggg cagctgcccg 2 0220 
aggtcgattg gtcgtcgggg gcggtccggc 20280 
ctgaccgtgt tcgtcgggtg ggggtttcgt 20340 
tgatcctcga acaagctacg aatgcgccag 2 0400 
ccggatctac tgtcgatatt ccggtcgttc 20460 
ccctgcgggg ,acaagccgaa cgagtcttgt 20520 
cgctggatgt tgcctactcg ctfcgcttctg 20580 
tgctgggtgc ggaccgcggt gagctggttg 2064 0 
aggcttctgg ggtgatcagc ggaactcgtg 20700 
ggcagggtgg tcagtggttg gggatgggca 20760 
ctgctgcgtt tgatgaggct tgcgccgagt 20820 
ttcgggatgt ggtcttcggt tccgatgcgc 20880 
cgggtctgtt cgcgctgcaa gccggcctct 20940 
cggatgtggt gatggggcat tcggtcgggg 21000 
tgtcgttgcg ggatgcggct cggttggtgg 21060 
cctctgacgg cgcgatgttg gcggtggctg 21120 
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ctggtgaaga ccttgttcgg ccattgctgg ccggtcggga ggagtccgtg agcgtcgccg 21180 
cgctcaatgc ccccggttcg gtggtgttgt cgggcgatcg ggaggtgctg gccagcatcg 21240 
tcggccggct gaccgagctc cgagtccgga cgcggcgctt gcgggtctcc catgcttttc 21300 
attcgcaccg gatggacccg atgttgggcg agttcgccca gatcgccgag tctgcggagt 21360 
tcggtaagcc aacgacaccg cttgtgtcga cgttgacggg tgagctcgac agagccgcgg 21420 
aaatgagcac accagggtat tgggtgcgcc aggcgcgtga acccgtccgt ttcgccgacg 214 80 
gtgtccaggc cctggcagcg cagggcatag gcacggtcgt cgagctcggc ccggacggaa 21540 
cgctggcggc actggttcgg gagtgtgcga ccgagtccga tcgggttggg cggatttcgt 21600 
cgatcccact gatgcgcagg gagcgggacg agacccgttc ggtgatgaca gccctggcgc 21660 
atctccacac ccgtggtggt gaggtggact ggcaggcgtt tttcgccggt accggcgcta 21720 
ggcagctcga gttgccaacg tatgccttcc aacgacagca ctactggatc gagtccagtg 21780 
cgcggccagc acgcgaccgc gcagacatcg gcgaggtggc ggaacagttc tggaccgcgg 21840 
ttgaccaagg cgatctggca acgttggtcg ccgctctgga tcttggggcg gacgacgaca 21900 
catgcgcatc gttgagcgat gtattgccgg cgttgtcctc ctggcgaagc ggactccgca 21960 
accgttcgct cgtcgattcc tgccggtacc gaatcagttg gcattcctct cgggaggtgc 22020 
cggccccgaa gatttccggt acctggctgt tggtcgtgcc cggtgctgcg gatgacggat 22080 
tggtcacggc tttgacgagt tcactggtcg gaggcggcgc cgaggtcgtc cggatcggcc 22140 
tgtccgaaga ggacccgcac cgcgaggacg tcgcacagcg gctggccaat gcgctgacgg 22200 
atgccggtca actcggtggc gtgctttcgc tgttggggct cgatgaatcg cctgctccgg 22260 
gattctcctg cttgccaact ggtttcgcgc tgactgtgca gcttctgcgg gccttgcgga 22320 
aggccgacgt cgaggcgcct ttttgggcgg tgacgcgcgg cggcgtcgcg ttggaagatg 223 80 
tacgcgtgtc tccggagcag gccctggtct gggggctgct gcgtgtcgcg ggactggagc 22440 
acccggagtt ctggggtggc ttgatcgacc tgccatcgga ctgggacgac cgattgggtg 22500 
cccggttggc gggtgtgttg gcggatggtg gcgaggatca agtcgccatt cgccgtggtg 22560 
gtgtgttcgt gcggcggttg gaacgcgctg gtgcgtcggg tgccgggtcg gtgtggcgtc 22 620 
ctcgggggac ggtgttggtg acgggtggta cgggcggttt gggggcgcat gttgcccggt 22680 
ggttggccgg tgccggggct gagcacgtgg tgttgaccag ccgtcgagga gcggacgctc 22740 
cgggcgctgg ggaattgcgg gcggagctgg aggcgctggg tgctcgggtg tcgattgtgc 22800 
cctgcgacgt ggctgatcgt gacgcagtgg ctggagtgtt ggcagggatc ggtggggagt 22 860 
gtccgctgac tgcggtggta cacgccgccg gggtcggcga ggcgggcgac gtagtggaga 22920 
tgggtttggc ggattttgca gcggtgttgt cggcgaaggt gcgtggtgcg gcgaatctgg 22980 
acgagttgct ggccgactcg. gagctggatg cgtttgtgat gttctcctcg gtgtcggggg 23040 
tgtggggagc cggcggacag ggtgcgtatg cggctgcgaa cgcctacttg gatgcgttgg 23100 
ccgagcagcg tcgggcgagg ggattggtcg ggaccgcggt tgcgtgggga ccgtgggccg 23160 
gtgacggcat ggccgccggc gaaaccggcg cacagctgca ccggatgggc ctggcgtcga 23220 
tggaaccgag cgcggcgctg ctggcacttc agggtgcatt ggaccgcgat gagacctccc 23280 
tcgtcgtggc cgatgtcgat tgggcacggt tcgccccagc cttcacctcg gcacgtcgac 23340 
gcccgctgct ggacaccatc gacgaggccc gagccgcatt ggaaaccacc ggcgaacaag 23400 
cgggcacagg caaacccgtt gagctgacgc aacgcctggc cggactgtcg cggaaggaac 23460 
gcgacgatgc ggtattggat ctggtgcggg cggagacggc ggctgtgctg ggacgcgacg 23520 
atgccacggc cctggcgcca tcgcggccgt tccaggaact cggattcgac tccttgatgg 23580 
cggtggagct gcgcaaccgg ctgaacaccg ccaccgggat ccagctgccc gccagcacga 23640 
ttttcgacta ccccaatgcc gagtcgctgt cgcgtcacct ctgcgccgag cttttcccaa 23700 
cggagactac cgtggactcg gcccttgccg agctcgatcg aatcgagcag cagctctcga 23760 
tgctcaccgg cgaagcgcgg gcacgggacc gaatcgcgac acgactgcga gccctccacg 2382 0 
agaagtggaa cagcgcagct gaagtaccga ccggagccga tgtcctgagc acgctcgatt 23 880 
cggcgacgca cgacgagata ttcgagttca tcgacaacga gctcgacctg tcctgagcag 23940 
ttcctgcgga acttcaagcg ccgaaatcgg gtggaaatca. caatggccaa tgaagaaaag 24000 
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ctcttcggct atctgaagaa ggtaactgcg 
gcggccgaga gccggagtca ggagccgatc 
ggcggcgtcg actctcccga agcgctctgg 
tcggagttcc ccgccgaccg gggctgggat 
caccagggaa cgtcgtacac gcgggccggc 
cccgccatgt tcgggatttc gccgcgtgag 
ttgctggagc tgtcctggga ggccctcgaa 
ggcagcaaga ccggtgtctt cggtggtgtc 
gagatgagcc gaaacgctgg gggttttgga 
ggtcgggttg cgtattcgtt tggttttgag 
tcgtcgtcgt tggtggccct gcatttggcg 
ctcgcgctgg ccggcggtgt gacggtgatg 
cgtcagcgtg gtttggctcc ggacgggcgg 
accgggtggg gtgagggtgc cggtctggtg 
aatgggcacg aggttctggc ggtggtgcgg 
aatggtttga ctgcgccgaa tggtccgtcg 
agtgcggggc tgtcggtttc cgatgtggat 
ttgggtgatc cgatcgaggc acaggccctg 
gatcggccgt tgtggttggg gtcggtcaag 
ggcgttgccg gcgtcatcaa gatggtcttg 
ttgcatgtgg atgagcccac gtcggcggtg 
acggagaaca cgccctggcc ggacagtggt 
gggatcagcg gcaccaacgc acatgtgatt 
ccggccgggc cggtcgaagg cgagcgggaa 
gtgctgtcgg gtaagacacc ggaggctgcg 
atcgaggacc ggccggggct gtcgccggtg 
gcggcgctgg atgaacgcgc agtggtgttg 
ttgagggcat tcgccgacgg ctgcgatgcg 
ggtggccgcg tcgggttcgt gttctcgggt 
gggctctact cggtgtttcc ggtgttcgcc 
gatgcacacc tgggccagga actgcgggtt 
ttgctggatc ggacggtgtg ggcgcagtcg 
cggctgctgg gttcgtgggg tgttcggccg 
ctggctgcgg tgcatgcggc tggtgtgttg 
ggtcgcgccc ggttgatgca ggcgttgcct 
ggtgagtttc aggtcgatcc tctgctggat 
gtgaatggcc cggaatcggt tgtgctctct 
gatcggttgc acgatcaggg gtgccggacc 
tcgccccata tggagccgat gctggaggag 
cacgcaccgg aactgccgat catctcgacc 
atgggcactc ccgagtactg ggtgcgtcag 
gtccaggcgc ttgtcggtca gggtgtcggc 
ttgtcgacgt tggtcgagga gtgtgtggcg 
atgcgcaagg accgcgacga ggcgcgaacc 
cgtggtggtg aggtggactg gcggtcgttt 
ctgcccacct acgccttcca gcggcagcgg 
gacgtgaccg ccgccggatt ggccgaggcg 
ttggc.agacg gcgaaggtgt ggtgctgacc 



gacctgcatc agacccggca gcgcctgctc 24060 

gcgatcgtct cggcgagctg ccgactgccc 24120 

caactcgtgc gcactggcac cgacgccatc 24180 

ctcggccggt tgtacgatcc cgacccgaac 2424 0 

ggtttcctcg caggagcggg cgatttcgac 24300 

gcgttggcga tggacccgca gcaacggttg 24360 

cgggcgggca tagacccgac atccctgcgc 24420 

acgccccagg agtacgggcc gtccttgcag 24480 

ctcaccgggc ggatggtgag tgtggcgtcg 2454 0 

ggtcctgcgg tgacggtgga tacggcgtgt 24600 

tgtcagtcgt tgcgttccgg cgaatgcgat 24660 

gcgacaccgg cgacgttcgt ggagttctcc 2472 0 

tgcaagtcgt tcgcggctgc cgcggatggc 24780 

ttgctggagc ggttgtcgga tgcgcggcgg 24840 

ggtagcgcgg tgaaccagga cggcgcgtcg 24900 

cagcagcggg tgatcaccca ggcgttggcg 24960 

gcggtcgagg cacatgggac cgggaccacg 25020 

atcgccacgt acgggcaggg ccgggagaag 25080 

tccaacatcg gtcacacgca ggcggccgct 25140 

gcgatgcggc acgggcagct gcccgccacg 25200 

gactggtcgg cgggttcggt ccggcttctc 25260 

cgtccttgcc gggtgggggt gtcgtcgttc 25320 

ctcgaacagt ctccagtcga gcagggcgaa 253 80 

ccggatgtag ccgtccccgt ggtgccttgg 25440 

cgggcgcagg ccgaacgggt gcattcgcat 25500 

gatgtggcgt attcgctagg aatgacacgc 25560 

ggctcggacc gtgccgcgct cctgaccggg 25620 

cccgaagtgg tttcggggtc tgtggggctt 25680 

cagggtggtc agtggccggg gatgggccgg 25740 

gacgcgttcg acgaggcttg cgcggagttg 25800 

cgggatgtgg tgttcggttc gcaagcgtgg 25860 

ggtttgttcg cgttgcagat tggcttgctg 25920 

gatgtggtgt tggggcactc ggtgggtgag 259 BO 

tcgttgtcgg aggccgcgcg gttggtggcg 26040 

tctggtggtg ccatgctcgc ggtcgctacg 26100 

ggggtgcggg accggatcgg tatcgcggcg 26160 

ggtgaccgcg agctgctcac cgagatcgct 26220 

cggtggttgc gggtgtcgca tgctttccat 262 80 

ttcgcccaga tctcccgagg ccgcgaatat 26340 

ctgatcggtg agctggacgg tggtcgagtg 264 00 

gtgcgtgagc ccgtccgttt cgccgagggt 26460 

acgattgtcg aattgggtcc ggacggggcg 26520 

gaatccgggc gggtggccgg gatcccgctg 26580 

gtgctggcag ctttggcgca gatccacacc 26640 

ttcgccggta ccggggcgaa gcaagtcgac 26700 

tactggctgg catccaccgg gcgtgcgggt 26760 

gaccatccgc tgctcggtgc ggtggttgcg 26820 

ggtcggttga cagcgggttc gcatccgtgg 26880 
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ttgtccgatc accgggtgct gggcgaaatc gtcgtccccg gcaccgcgat cgtcgagctg 26940 
gtgtggcacg tcggcgagcg cctcggttgt ggccgggtgg aagaactggc tttggaagcg 27000 
cccctgatcc tgccggatca tggagcggtc caggttcagg tgctggtggg accgcccggg 27 060 
gaatccggag cccggtcggt ggcgctctac tcctgtcctg gcgaggcgat cgaacccgag 27120 
tggaagaagc acgcgacggg cgtgcttctc ccacccgtgg ccgccgagaa ccatgagctg 27180 
accgcatggc ccccggagaa tgcgaccgaa atcgatgcag acggggtcta cgcattcctt 27240 
gaagggcacg gtttcgcgta cggaccggcc tttagatgtc tgcgcggtgc ctggcgacga 27300 
ggcggggagg tgttcgccga agtcgcattg ccggatgaca tgcaggcggg ggtcgatcga 27360 
ttcggcgtcc accccgcgtt gctggacgcg gttctgcatg ccgccgcagc cgagacgtcg 27420 
gtggtccaga gcgaagcgcg ggtgccgttc tcgtggcgtg gggtggaact tcgcgccact 27480 
gaaagcgcgg tggtgcgggc gcgcctctcg ttgacttcgg atgacgaact gtcgttggtc 27540 
gcagtggacc cggctggccg attcgtggcc acggttgatt cgctggtgac ccgaccgatc 27600 
tcccggcagc aggtgaggtc tggcgcgatc ggtgattgcc tgttcgaggt ggagtggcac 27660 
cggaaggcgt tgttgggaac aaccgccggc gacgaccttg ccatcgtcgg tgacggtccc 27720 
agttggccgg aatcggtgcg cgcaaccgca cggttcgcga ccctggatga gttccgtgcg 27780 
gccgtggact cggacgttcc tgccccgggt tcggtgttgg tcgcagctat gtcggccgaa 27840 
gaggtcgagg gtggatccct gccgtcgcgc gcccaagagt cgacctccga tctgctggct 27900 
ctcgtgcagt cgtggcttgc ggacgagcgg ttcgccgaat cccagctcgt ggtcgtcacg 27960 
cgtgcagcgg tgtcggccga ctcggattcg gacgtcgcgg acctggtggg tgcgtcgtcg 28020 
tgggggttgt tgagttcagc ccagtcggag aacccgggtc gcttcgtgct ggtggacgtg 28080 
gacggcacac ctgagtcgtg gcaggcgttg ccggccgccg tgcgagcagg agaaccgcag 28140 
ctggcacttc ggcgcggcgt ggcgctggtg cctcggttgg cgcgactcac ggtgcgcgag 28200 
gagggctcct ccccgcaact cgacacggac gggaccgtcc tcatcacggg tggcaccggt 28260 
gcgttggggg gagtggttgc ccgtcacctg gtggaggagc acgggattcg gcgtttggtg 28320 
ttggcaggcc ggcgtggctg gaatgcgcct ggagtccacg agttggtgga tgagctggcg 28380 
cgcgcgggcg ccgtggttga -ggtggtggct tgcgatgtgg ctgaccgcac cgatctggag 28440 
cacgtgctgg ccgccattcc ggtcgactgg ccgctgcggg ggatcgtgca taccgctggg 28500 
gtgctggccg acggagtgat cgggtccttg tcggcggcgg atgtgggcac ggtgtttgcc 28560 
ccgaaggtga cgggggcatg gcatctgcac gagttgaccc gcgatctgga tctgtcgttc 28620 
ttcgttcttt tctcttcctt ctccgggatt gcgggtgccg cagggcaggc caactacgcg 28680 
gcggcgaaca cgttcctgga tgcattggcg cgttatcgcc gggcgcgtgg gctgcctggg 28740 
ttgtcgttgg cgtggggact gtgggcgcaa cccagcggta tgacgagtgg cttggacgcg 28800 
gcgtcggtgg agcggttggc gcggacgggc atcgcagaac tttccacgga ggatggactc 2 8860 
cgcctgttcg atgccgcgtt cgcgaaggac cgggcttgcg tcgttgccgc tcgattggac 28920 
agggcgctgc tggtcgggaa cggacgatcg cacgcgattc cggcgctgtt gagcgcgttg 28980 
gttcctgttc gcggcggtgt ggcgaggaaa acagccaatt ctcaggccgc ggatgaggac 29040 
gcactgttgg gtttggtgcg ggagcacgtt tcggccgtgc tgggttattc gggtgcggtc 29100 
gaggttgggg gcgaccgtgc tttccgtgat ctgggttttg attcgttgtc tggcgtggag 29160 
ttgcggaacc gccttgccgg ggtgctgggg gtgcggttgc cggcgactgc ggtgttcgac 29220 
tatccgacgc cgcgggcgct ggcgcgtttc ctgcatcagg aactggcagg cgaggtcgcg 29280 
tccacgtcga cgccggtgac cagggcagcg agtgccgaag aggatcttgt tgcgattgtc 29340 
gggatgggat gtcgttttcc gggtggggtg tcgtcgccgg aggagctttg gcggctggtg 2 94 00 
gccggcggcg tggatgcggt ggctgggttc ccagacgatc gcggctggga tctcgcggcg 2 94 60 
ttgtacgatc ctgatcccga tcgtctcggg acctcgtatg tgtgtgaggg cgggtttctg 29520 
cgggacgcgg cggagttcga tgctgacatg ttcggcatca gcccgcgtga ggcgttggcg 29580 
atggatccgc agcagcggtt gctgctggag gtcgcctggg aaaccttgga gcgggctggg 2 964 0 
atcgatccgt tctcgttgca cggcagccgg accggtgtgt tcgcgggctt gatgtaccac 297 00 
gactatgggg cccgattcat taccagagca ccggagggct tcgaagggca cctcgggacg 2 97 60 
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ggcaatgcgg ggagcgtgct gtcgggtcgg 
gcggtgacgg tggatacggc gtgttcgtcg 
gcactgcggg ccggtgagtg cgaattcgcc 
ccgacgacgt tcgtggagtt ctcccgtcaa 
tcgttcgcgg cggccgcgga tggcaccggg 
gagcggttgt cggatgcccg gcgcaatggg 
gcggtgaacc aggacggcgc gtcgaatggc 
agggtgatca cccaggcact cacgagtgcc 
gaggcgcatg ggacgggcac gcggcttggt 
acgtacggcc gggatcgtga tcccggtcgg 
attggtcaca cccaggcggc ggcgggtgtc 
cggcaggggg agctgccgcg cacgttgcac 
tctgcgggca cggtccaact cctcacggag 
cgccgggcgg gcgtgtcatc gttcgggatc 
caacctccgc gagagtcgca gcgctcaaca 
ccggtggtgc cgtggatggt gtcgggcaaa 
gcattgatgt cctacttgag caatcgcgtt 
cttgcggtga cccgtccggc gttggaccac 
gcgttgctgc cgggcttgaa agcgctggcc 
ggcactcgtg ccgctgggcc ggtcggattc 
gggatgggaa gcgggctcca ctcggcgttt 
tgctgcgagc tggatgcgca tctcgggcag 
tcggatacgc aacttctgga ccagaccttg 
gtcggactct gggagttgtt gggttcgtgg 
tcggtcggtg agctggcggc ggcgttcgcg 
cggctggtgg cgggccgtgc ccggttgatg 
gctgcggctg ctggagagga gcagctgcgc 
gggatcgccg cggtcaacgc tcccgggtcg 
gatgacattg ccggtcggct ggacgggcaa 
catgcgtttc attcgcatcg gatggatccg 
agcgtggact accggtcgtc agggctgccg 
gaggtcggca tgccggctac gccggagtat 
ttcgccgacg gtgttgctgc gctcgcggct 
ccggatgggg tgttgtcggc gctggtgcag 
cgggtggccg cggttccgct catgcgcagc 
gcattggcgc agatccatgt gcgtggtgct 
accggggcaa agcaggtcga gctgcccacg 
gactcaccat ccgaaccggt cgggcaatcc 
gaactcgtcg agcaggaaga tgtcagcgcg 
cacgacgtgc aggcgtccct ggaatcggtg 
atccgcaacg aatccctggt gcaccagtgg 
gatttgccag acccctcgtt gtcggggaca 
gcgagtcggc aagttctgcg tttcaacgag 
ctgttcgagc tcgccgggca cgacgaggaa 
gttgcgtcag ggggaataag cggcgtgttg 
tcgccgaacg ctgctttgcc gaatggcgcg 
cgggccgcgg atgtgtcggc gccattgtgg 
gatgtgccgg tgaacccggg gcaggcgctg 



gttgcgtatt cgtttggttt cgagggtcct 29820 
tcgttggtgg cgttacacct ggcgggtcaa 29880 
cttgccggtg gcgtcacggt gatgtcgacg 29940 
cggggtctgg ctccggatgg gcggtgcaag 30000 
tggggcgagg gtgccggtct ggtgttgctg 30060. 
cacgaggttc tggcggtggt gcggggtagc 30120 
ttgactgcgc caaatggtcc gtcacagcaa 30180 
gggctgtccg tgtccgacgt ggatgctgtg 30240 
gatccgatcg aggcgcaggc gttgatCgct 30300 
ccgttgtggc tggggtcggt gaagtcgaat 30360 
gctggtgtga tcaagatggt gatggcgatg 30420 
gtggacgagc cctccgcgca ggtggactgg 30480 
aacacgccct ggcccgacag cggtcgtctt 30540 
agtggcacca acgcgcacct gatccttgaa 30600 
gagccggatt cgggttctgt ccgcgatttt 30660 
acacccgaag cgctatccgc ccaggcagat 3072 0 
gatgcttccc cgcgagatat cggttattcg 30780 
cgcgctgtcg tgctgggtgc ggatcgtgcc 30840 
gttagtaatg acgctgccga ggtgatcacc 30900 
gtgttctccg gtcaaggtgg tcagtggccc 30960 
ccggtgttcg ccgacgcgtt tgacgaagcc 31020 
atggcccggc tacgagatgt gttgtccggt 31080 
tgggcgcagc cgggcctgtt cgcgttgcaa 3114 0 
ggtgtccggc ccgctgtggt gctgggccac 31200 
gctggagtgt tgtcgttgcg ggatgcggct 31260 
caagccctgc caactggcgg tgccatgctc 31320 
ccgttgctgg ccgactgcgg tgatcgtgtg 31380 
gtggtgctct ccggtgatcg ggatgtgctc 31440 
gggatccggt ccaggtggtt gcgggtttcg 31500 
atgctggcgg agttcaccga aatcgcccgg 31560 
atcgtgtcga cgttgacggg tgagctcgat 31620 
tgggtgcgcc aggtgcgaga acccgtccgc 31680 
cacggtgtga gcaccgtcgt cgaggtcggt 31740 
gagtgcgcgg ccggatccga tcagggcgga 31800 
aatcgcgacg aggcgcacac ggtgacaacg 31860 
gaggtggact ggcggtcgtt tttcgccggt 31920 
tatgccttcc aacgacagcg gtactggctt 31980 
gccgatcccg cgcgccagtc gggcttctgg 3204 0 
ctcagcgccg ctctgcacat taccggcgat 32100 
gttccggtcc tctcctcctg gcatcgccgg 32160 
cggtaccgga tttcctggca tgagcgggca 32220 
tggctcgtcg tcgtgccgga ■ ggggtggtcg 32280 
atgttcgagg aacggggttg cccggcagtt 32340 
gccctggcgc aacgattccg ctcgttgcct 32400 
tccttgctgg cgctggatga atcgccgtcc 32460 
ctgaactcgt tggtactgct gcgagctctg 32520 
ttggcgacgt gtggtggtgt cgcggtcggg 3258.0 
gtgtggggac tgggtcgcgt cgtcggtctg 32640 
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gagcatccgg cctggtgggg tggcctggtc gacgtgccgt gcttgctcga tgaggacgct 32700 
cgagaacgct tgtcggtcgt gttggcaggt cttggcgagg acgagatcgc ggtacgtccc 32760 
ggtggtgtgt tcgtgcggcg gttggaacgc gctggtgcgg cgtcgggtgc cgggtcggtg 32820 
tggcgtcctc gggggacggt gttggtgacg ggtggtacgg gcggtttggg ggcgcatgtt 32880 
gcccggtggt' tggcgggtgc cggggctgag catgtggtgt tgaccagccg tcgaggcgcg 32940 
gcggctccgg gcgctggaga tttgcgggcg gagctggagg cgctgggcgc tcgggtttcg 33000 
atcacggcct gcgacgtggc cgatcgtgac gctttggccg aagtgttggc gaccattccg 33060 
gatgattgcc cgctgaccgc ggtgatgcat gcggcggggg tcgttgaagt cggcgacgtg 33120 
gcgtcgatgt gtttgaccga cttcgttggg gtgctgtcgg cgaaggcagg tggtgcggcg 33180 
aatctcgatg agttgctcgc cgatgtcgag ctggatgcct tcgtgctgtt ctcatccgtc 33240 
tcgggtgtgt ggggtgctgg cgggcagggc gcttatgcgg cggcgaatgc ctacttggat 33300 
gcgttggcgc agcagcgtcg ggcaaggggg ttggtgggga ctgcggttgc gtggggcccg 33360 
tgggccggtg acggaatggc cgcaggtgaa ggcggtgcac agctgcgccg ggccggcctg 33420 
gtgccaatgg ctgcggatcg ggcgttgctg gcacttcagg gcgcattgga tcgtgacgag 33480 
acatccctgg tcgtggccga tatggcgtgg gagaggttcg ccccggtgtt cgccatgtcc 3 3540 
cgtcggcgtc cgctgctcga cgagctgccc gaagcacagc aggcgttggc ggatgcggag 33600 
aacaccactg atgctgcgga ctcggccgtc ccgctaccgc ggctcgcggg catggcagcc 33660 
gccgaacgcc gccgcgcgat gctggacctg gtgctggcgg aggcctcgat tgtgttggga 33720 
cacaacgggt ctgacccagt tggtcccgac cgggcgttcc aggagctcgg atttgattcg 33780 
ctgatggccg tcgaactgcg caacaggttg ggcgaggcaa caggattgag tctgccggcc 33840 
acgttgatct tcgattatcc gagcccatcc gcgctggctg agcagctggt cggcgagctg 33900 
gtgggagcgc agcccgcgac caccgtcgtg gccggggccg atccagtgga tgatccggtt 33960 
gtcgtggtcg cgatgggatg ccggtatccg ggcgacgtct gctcgcccga ggagctgtgg 34 020 
cagctggttt ctgcgggacg tgatgcggta tcgacgttcc ccgtcgatcg gggttgggac 34 080 
tgcaacacgt tgttcgaccc ggatccggat cgggcaggca gtacctatgt gcgagaaggt 34140 
gccttcctga ccggtgctga tcggttcgac gccgggttct tcggcatcag ccctcgcgag 34200 
gcgcgcgcaa tggatccgca gcagaggttg ttgctcgaag tggcgtggga ggttttcgaa 34260 
cgagcaggaa tcgctccgct gtcgttgcgg ggtagcagga ccggtgtgtt cgcggggacc 34320 
aatgggcagg accacggtgc gaaagtggct gccgcgccgg aggcggcggg tcacctcctg 34380 
accggaaacg ccgcgagtgt cctggccggc cggctttcct acacgttcgg ccttgagggg 34440 
cctgcggtgg cggtggatac cgcgtgttcg tcgtcgttgg tggcgttgca tttggcgtgc 34500 
cagtcgctgc gttcgggtga gtgtgatatg gcgttggcag gtggtgtgac ggtgatgtcg 34560 
acacccctgg ctttcctcga gttctctcgt cagcgcggtt tggcgccaga tggtcggtgc 34620 
aagtcgtttg cggccgctgc ggatggcacc gggtggggtg agggtgccgg cctggtgttg 34680 
ctggagcggt tgtcggatgc tcgtcggaat ggtcaccggg tgttggccgt ggttcgcggg 34740 
tctgcggtga atcaggatgg tgcgtcgaat ggcctgactg cgccgaatgg tccgtcgcag 34 800 
cagcgggtga ttcggcaggc cctcgcgaat gcggggctgt cggcgtccga tgtggatgtc 34860 
gtggaggcgc acgggaccgg taccgggctc ggggatccga tcgaggcgca ggcgctgatc 34 920 
gcgacatatg ggcaggagcg ggatcctgag cgggccctgt ggctggggtc gafcaagtcc 34 980 
aacatcggcc acacgcaggc ggcggccggt gtggcggggg tcatcaagat ggtgcaggcc 35040 
atgcggcacg gggagttgcc tgcgacgttg cacgtggaca agcccactcc acaggtggac 35100 
tggtctgccg gggccgttcg gctcctcacc gggaacacgc cctggcccga gagcggccgt 35160 
cctcgtcgag cgggggtgtc gtcgttcggg atcagcggca ccaacgcaca cctcatcctc 35220 
gaacaaccac cgtcggaacc agcggagatc gaccaatcgg atcggcgggt cactgcgcat 35280 
ccagcggtga tcccgtggat gttgtcggct aggagtctcg cagcgctgca ggcccaagcg 35340 
gctgcgctgc aggcccggct ggaccggggt cctggcgctt ctccgctgga tttggggtat 354 00 
tcactcgcga ccactcgttc tgtgctggac gaacgcgccg tcgtgtgggg tgccgatcgg 35460 
gaggcactgc tgtccaggct ggcagcgctc gccgatggcc ggacggcgcc gggggtgata 35520 
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acgggctctg cgaattccgg tggccgcatc 
tggctgggga tgggaaaggc gttgtgcgcg 
gaagcctgcg acgcgctaag cgcacacctg 
gctgatgagc agatgctcga ccggacgctg 
gtcggcctcc tgggattgct gaggtcgtgg 
tcggtcggcg agttggctgc ggcgcacgcg 
cggttggttg cggctcgggc ccacctgatg 
gcggtcgcca ccagcgaggc ggcggtcgga 
agcatcgctg cgatcaacgg ccccgagtcg 
gtggagctcg caggcgaatt cgatgcccga 
catgctttcc actcgcaccg gatggaaccg 
tgcgtcgagt tcggtgaacc ggtggtgccg 
accaccggac tgatgtgcgc ggccgactac 
ttcggagacg gtgtccgggc gctcgtcggc 
ccggacgggg cgttgtcggc cctggtcgag 
agggtggcgg cgatcccgct gatgcgcagg 
gccctggcgc acgtgcacgt ccgcggtggt 
accggcgccc gcaccgtcga gttgcccacc 
gccgggcaag cggacgggcg cggcggcgat 
tgggagttgg tcgagcgcgc cgatccggaa 
gaccagccct tccgggaggt gctgcccgtt 
gaggccctcg cggattcctg gcgctaccag 
gcagccgccc tccggggcgt gtggctggtg 
ccggcggtcg tcatcgacgc gctgatcgcg 
accgagcagg acctccaacg cagtgcgctt 
cgcaccgagg tgacgggtgt gttgtctctg 
catccgcacc tgtcccgtgg tgtcgccgct 
gcgggtgttt ccgccccgct gtggctggcc 
gacggtccgg ccgatccgga ccacggcttg 
gaacatccgc agtggtgggg tggcctgatc 
cggaacgggt tggtggccgc actcgccggg 
tcatccgggt tgttcgttcg cagagtggtg 
tggcgtagcc ggggaacggt cctcatcacg 
gcacgatggc tggcccggcg gggagctgag 
gaagctcccg gcgcagcgga cctaggggcc 
gtcttggcct gcgatgtgac ggaccgcgac 
acggagtatc cgctgtcggc ggtcgtgcac 
gccgagacga ccttggcgca gttcgccgac 
aacctggacc ggctgcttgg cgggcaaccg 
tcgggagttt ggggagccgg cggccaagga 
gcccttgccg agcgccgacg ggcttgcggg 
tgggcgggtg cgggcatggc cgttcaggaa 
ctggtaccga tggaaccgca gtcggccctc 
gaaaccgcca tcaccgtcgc agatgtggac 
gcccgcccgc gaccactgtt ggaagagatc 
gagaagcacg gtgccggcga gctggggcag 
ggacacctgc tgctggaggt ggtgctggcg 
gcggaggctg tgcaacccga tcggaccttc 



ggattcgttt tttccggtca gggcagtcag 35580 
gctttcccgg cgttcgcgga cgccttcgag 3 5640 
ggcgcggacg ttcggggtgt gctgttcggt 35700 
tgggcgcagt cggggatctt cgcggttcaa 35760 
ggcgtgcggc cggccgcggt gctggggcac 35820 
gctggtgtgt tgtccttgcc ggacgctgca 35880 
caggcattgc ccaccggcgg cgcaatgctc 35940 
ccgctgcttt ccggggtgtg cgatcgggtc 36000 
gtagtgctct ccggcgaccg cgatgtgctc 36060 
gggcttagga ccaaatggtt gcgggtctcc 36120 
attctggacg agtacgcgga aaccgccagg 36180 
atcgtctccg ccgcgaccgg tgcgctggac 3624 0 
tggacgcgcc aagtgcgtga tcctgtccgc 36300 
caaggcgtgg acacgatcgt cgagttcggc 36360 
cagtgcttgg ccgggtccga ccaggctggg 36420 
gaccgcgatg aggtcgagac cgcggtggcg 364 80 
gcggtggact ggtcggcttg cttcgccggc 36540 
tacgccttcc aacgccagcg gtactggctg 36600 
gtggttgccg acccggtcga cgcgcgcttc 36660 
ccgttggtgg atgaactctg catcgaccgg 36720 
ctggcttcct ggcgcgagaa acaacgccag 36780 
gtgcgctgga ggtccgtcga ggtgccgtcc 3684 0 
gtgcttccag ctgacgtgcc ccgagatcaa 36900 
cgcggcgccg aggtcgcggt cctggaattg 36960 
gtggacaagg tgcgcgccgt cattgcggac 37020 
ttggcgatgg acggcatgcc ctgcgcggcg 37080 
accgtgatcc tgacgcaggt gttgggcgat 37140 
acgaccggtg gcgtcgaggc cgggaccgag 37200 
atctgggggc tcggcagggt cgtcggcctt 37260 
gaccttccgg agacactgga cgagacgtcc 3732 0 
acggcggccg aagatcagct cgccgtgcgt 37380 
cgcgcagcgc ggaacccccg gtcagagaca 3744 0 
ggcggaacag gcgcgctcgg tgccgaggtc 37500 
cacctggtgt tgatcagtcg ccgcggcccg 37560 
gagctgactg aactcggcgt gaaagtcaca 37620 
gagctggcgg cggtgctggc ggccgttccc 37680 
accgccggcg tcgggacgcc tgcgaacctg 3774 0 
gtgttgtcgg ccaaggtcgt cggcgcggcg 37800 
ttggacgcct tcgtgctgtt ctcctcgatc 37860 
gcctattcgg ccgccaatgc gtatctcgat 37920 
cggccggcga cgtgcatcgc ctggggtccg 37980 
ggtaacgagg cgcatctccg ccgaaggggc 3 8040 
ttcgcgctgc aacaggccct gtcccaacga 38100 
tgggagcgat tcgccgcctc tttcaccgcg 38160 
gtggatctac ggcccgacac cgagaccgag 38220 
cagctggccg cactgccgcc cgctgagcgc 38280 
gaaaccgcca gcaccctggg gcacgattcg 38340 
gccgaactgg gcttcgattc gctgaccgcg 38400 
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gtagagctgc gcaacaggtt gaacgcggtg accgggcttc gcctgccgcc gacgctggtt 38460 
ttcgaccacc cgacgccgct ggcgttgtcc gaacagttgg ttccggccct ggtcgcggag 38520 
ccggacaacg gcatcgaatc gctgctcgcc gagctcgaca ggctggatac cacgttggcg 3 8580 
caagggcctt cgatcccact ggaagaccag gccaaggtgg cggagcgctt gcacgcactc 38640 
ctcgccaagt gggacggggc gcgtgacggc acggccagag cgacgtcacc ccaatcgctg 38700 
acggcggcca cggacgacga aatcttcgac ctcatcgacc ggaagttccg gcgctgaccg 3 8760 
ccctttcctc gcctcagctc ccctgattac tggaacggtg tatttcgatg gccaatgaag 38820 
aaaagctccg cgagtacctc aagcgtgtcg tcgtcgaact ggaagaggcg cacgaacgcc 38880 
tgcacgagtt ggagcgccag gagcacgacc ccatcgcgat cgtgtcgatg ggatgtcgtt 38940 
atcccggtgg cgtctccact ccggaggagc tgtggcgact ggtcgtcgac ggaggagacg 39000 
cgatcgcgaa cttccccgaa gaccgtggct ggaatctgga cgagctgttc gatcctgatc 39060 
cgggccgagc cgggacctcc tacgtccgcg agggtggttt cctgcgcggg gtcgcggact 39120 
tcgatgccgg gctcttcggg atcagtccgc gcgaggcaca ggcgatggac ccgcaacagc 39180 
ggttgctgct ggagatctcg tgggaggtgt tcgagcgcgc cggcattgac ccgttttctt 3 924 0 
tgcggggtac caagaccggt gtgttcgcgg gcctgatcta ccacgactac gcgtcgcggt 39300 
ttcgcaagac ccccgcggag ttcgagggtt acttcgccac cggcaacgcg ggcagcgtcg 39360 
catccggccg ggtggcttac accttcgggt tagagggccc ggcggtcacc gtggacaccg 39420 
cctgctcgtc gtccctggtg gcgctgcacc tggcctgcca gtccctgcgg ctgggcgaat 39480 
gcgacctggc cctggccggt ggcatttcgg tgatggccac gccgggagcc ttcgtcgagt 39540 
tcagccggca acgcgcactc gcctcggatg gccggtgcaa gcccttcgcg gatgccgccg 39600 
acggcaccgg ctggggcgag ggcgccggaa tgctgctgct ggaacggctg tcggacgcac 39660 
gacgaaacgg ccacccggtg ctggcggcgg tggtcggttc cgcgatcaac caggacggga 3972 0 
cgtccaacgg cctgaccgcg cccagcggtc ccgcacagca gcgagtgatc cgccaagccc 39780 
tggcgaacgc cgggttgtcg cccgccgagg tcgatgtggt cgaggcgcac ggcacgggca 3984 0 
cggccttggg cgacccgatc gaggcgcagg ccctgatcgc cacctacggg gcgaaccggt 39900 
cggcggatca tccgctgctg ctgggttccc tcaagtcgaa catcggccac acccaggctg 39960 
ccgccggtgt ggccggggtg atcaagtcgg tcctggccat caggcaccgg gagatgcccc 40020 
gcagcctgca catcgaccag ccatcgcagc acgtggactg gtcggcgggc gcggtgcggc 40080 
tgctcacgga cagcgttgac tggccggatc tcggcaggcc gcgccgagca ggggtgtcct 4 014 0 
cgttcggcat gagcggtacc aacgcacacc tgatcgtcga ggaagtatcc gacgagccgg 40200 
tctcgggcag taccgagccg accggggcat ttccctggcc gctgtccggc aagacggaga 40260 
cggcattgcg cgagcaggct gccgagttgc tctccgtagt gaccgagcac ccggagccgg 40320 
gactggggga cgtcgggtac tcgctggcca ccggtcgcgc tgcgatggag caccgggctg 40380 
tcgtggttgc cgacgatcgg gactctttcg tcgccggact gacggcgttg gctgcgggcg 40440 
ttccggcagc caacgtggtg cagggcgcgg ccgactgcaa gggaaaggtc gcgttcgtgt 40500 
tccccggcca gggctcgcat tggcagggga tggcgaggga actgtccgaa tcctcgccgg 40560 
tgttccggcg gaagctggcg gaatgcgcgg cggctacggc cccttacgtg gactggtcgc 4 0620 
tgctcggcgt ccttcgcggt gatcccgatg cacccgcgct ggatcgcgac gacgtgattc 40680. 
agctcgcgct gttcgccatg atggtgtcgc tggccgaact gtggcgttcg tgcggagtgg 40740 
agcccgccgc ggtggtcggt cattcccagg gcgagatcgc cgccgcccat gtggcaggcg 40800 
ctttgtcctt gactgatgcg gtgcgcatca tcgctgcccg ctgcgatgcg gtgtcggcgc 40860 
tgaccgggaa gggaggcatg ctcgcgattg ccttgccgga aagcgcggtg gtgaagcgaa 40920 
tcgcaggcct gccggagctg accgttgcgg cggtcaacgg acccggctcc actgtcgttt 40980 
ccggcgaacc gtcggctctg gagcgtctgc agaccgaact gaccgcggaa aacgtgcaga 41040 
cccggcgggt gggaattgat tacgcctcgc attcgccgca gatcgcgcag gtccagggcc 41100 
ggcttctgga ccggctgggc gaagtcgggt ccgaacctgc tgagatcgct ttctactcga 41160 
cggtcaccgg cgagcggacg gacaccggcc gactcgacgc cgactactgg taccagaacc 41220 
ttcggcagcc cgtccgcttc cagcagaccg tcgcccggat ggcagatcag ggctatcggt 41280 
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tcttcgtcga ggtgagcccg cacccgctgc 
ccgcggacgc gggcggggtg gtggtcggtt 
gctggctgac ttcgctggcc gagtgccagg 
tattcctcaa cacc'ggagcc cgacgcgtgc 
ggtactggtt ggagtccgcc gagtacgacg 
ccgccgagca tcccctgctc ggggctgcgg 
tgaccggcaa gctgtcggtc aagacccagc 
cgatcctgct gcccggcacc gcgttcgtgg 
ggtgcgatct gatcgaggag ttgtccctga 
cggtgcaggt gcagatcgcg gttggcggtc 
tgcattcctg tcgagacgac gccgtgccgc 
cgttgacctc cagcgatcac caggacgccg 
acgatgctgt cgcggttccg ctggacagct 
atttcggccc ggcgttccag gggttgcagg 
ccgaggtcgg cctgcccacc gcacaccgcg 
ctctgctgga tgcggcactg caggcgctgg 
ggctcccgtt cgcgtggcaa ggtgtgtccc 
tgcacctcgt tccggcgggc gcgaatgcgg 
aagccgtgct ctccatcgat tcgctggtgc 
cggcccgtgc gatggaacac gagtccctgt 
gcgctgccaa gccggtctcc tgggcagtga 
gctcggcact tggcacggaa ctccaccccg 
acgtcgtggt ggtgccatgc ggtgcgtctc 
gtgccgcgac acaacgcatg cttgacctga 
ccggatctcg cctggtggtt gtgacgtgtg 
tttccgacct ggtgcatgct gcgtcgtggg 
cggaccgatt cgtgttggtc gatgtggacg 
cggccgtgcg ttccggagaa ccgcagctgg 
gcctggcgcg atgtgttgcc gccgaggaca 
cggtgttgat ttccggcggt acgggcctgc 
cggagcgcgg tgtccgccgc ctggtgctcg 
tcaccgacct ggtggatgag ttggtgggcc 
atgtcgggga tcgggcccag ttggaccggc 
tgcgcggagt ggtgcatgcg gccggggcac 
cagagcacgt ggcaaaggtg ttcggcccga 
tgactcttga tctggatctc tcgttcttcg 
gggctgcggg tcagggaaac tacgcggcgg 
accggcggac ggcggggctg cctgcggtgt 
gcgggatgac cggagcgctc gatgcggcgg 
cgatgtccgc gccggacggg ttgcggctgt 
cgcttctggt tccggtccac gtcgacctga 
gtgtgcctgc gttgttgcgc gacctggtgc 
agtcggcgga cgtcaacggt ctggttggtc 
aaacccagct gttgggtttg gtgcgcgagc 
cggtcgaggt cggggccgat cgtgctttcc 
tggagtttcg gaaccggctt ggcggggtgc 
tcgactatcc gacaccgcgg gcgttggttc 
tggaggctcc gactcccgca ccggcggctg 



tcaccgccgg aatccaggaa acgctggaag 4134 0 
cgctgcggcg tggcgagggc ggctcccggc 41400 
tgcgcggact gccggtgaat tgggaacagg 414 60 
cgctgccgac ctacccgttc cagcggcagc 41520 
cgggcgatct cggttcggtg ggcttgctct 41580 
tgacgctggc cgatgcgggc gggttcctgc 4164 0 
cctggttggc cgaccacgtg gtcggcgggg 41700 
aaatgctgat acgcgccgcg gaccaggtcg 41760 
cgactccgct ggttttgccc gcgaccggtg 41820 
cggacgaggc cgggcgccgc tcggtccgcg 41880 
aggactcgtg gacctgccac gcgaccggca 41940 
gccagggccc cgatgggatt tggccgccca 42000 
tctacgcccg cgcagctgag cggggcttcg 42060 
cggcttggaa gcgcggagac gagatcttcg 42120 
aagacgccgg caggttcgga atccaccctg 42180 
gcgcagccga agaggatccg gacgagggat 42240 
tcaaagcgac gggcgcactt tcccttcggg 42300 
tgtcggtgtt cacgaccgac acgactggcc 42360 
tgcgccagat ttcggacaag cagttggcag 42420 
tccgggtcga ctggaagcga atctcgcccg 42480 
tcggcaatga cgaactcgcc cgagcctgcg 42540 
acctgaccgg gttggctgac ccgcccccgg 42600 
gccaggactt ggacgttgct tccgaggcac 42660 
tccaggattg gttggcggcg gcgcgattcg 42720 
gtgcggcgtc gacaggtccc gccgagggtg 42780 
gtttgttgcg ttcggcgcag tcggagaacc 42840 
gaaccgccga atcatggcgt gcgctcgcgg 42900 
cgttgcgcgc cggtgaagtc cgggtgcctc 42960 
gccggatccc agtgcccggt gcggatggga 43020 
tgggcgggtt ggttgcccgg catttggtgg 43080 
cggggcgacg cggctggagc gcccccgggg 4314 0 
tgggagctgc ggtcgaggtg gcgagctgcg 4 32 00 
tgctgacgac gatctcggca gagttcccgc 4 3260 
ttgccgacgg ggtcgtcgag tcgctgacac 43320 
aggccgccgg tgcgtggcac ctgcacgagt 43380 
tgctcttctc ctcgttctcc ggcgtggcgg 43440 
cgaacgcgtt cctggacggc ctggctcagc 43500 
cgctggcttg gggcttgtgg gagcagccca 43560 
gccgtagccg cattgcgcgc accaatccgc 4 3620 
tcgagatggc gtttcgcgtt ccgggcgaat 4 3680 
acgccctgcg cgctgatgcg gccgacggcg 4374 0 
cagcgcccgt gcggcggagc gcggtcaacg 43800 
ggctgcggag gctgccggac ctggatcagg 43860 
atgtttcggc ggtgctgggg cattcgggtg 4 3920 
gggatttggg ttttgattcg ttgtccggtg 43980 
tgggcgttcg gttgccggct actgcggtgt 44040 
ggttcttgct cgacaaactg attggtggcg 44100 
tggcggcggt gactgctgac gatcccgttg 44160 
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tgatcgtggg gatgggctgt cgttatccgg gtggggtgtc ctcgccggag gagctttggc 44220 
gtttggtggc cgggggcttg gatgcggtgg cggagttccc ggacgatcgt ggctgggatc 44280 
aggcggggtt gttcgatccg gatcccgatc gtcttgggac ctcgtatgtg tgtgagggtg 44340 
gcttcctgcg agatgcggca gagttcgatg ccggtttctt cgggatttcc ccgcgtgagg 44400 
cgttggcgat ggatccgcag cagcggttgc tgctggaagt cgcttgggaa accgtggagc 44460 
gggcggggat tgatccgctt tcgttgcggg ggagccggac cggcgtgttc gcggggctga 44520 
tgcaccacga ctacggcgcg cggttcatca cgagggcgcc ggagggtttc gagggttatc 44 580 
taggtaatgg cagcgcggga ggcgtgtttt cgggtcgggt tgcgtattcg tttggtttcg 44640 
agggtcctgc ggtgacggtg gatacggcgt gttcgtcgtc gttggtggcg ctgcacctgg 44700 
cgggtcaagc actgcggtct ggtgagtgtg atctggctct tgcgggtggt gtgacggtga 44760 
tggccacgcc ggggatgttc gtggagtttt cgcgtcaacg gggcttggcg gcggatgggc 44820 
ggtgcaagtc gtttgcggcg gctgcggatg gcaccggttg gggagaaggc gcgggcttgg 44880 
tgttgttgga gcggctgtcg gatgcccggc gcaacgggca cgcggttctg gcggtcgtgc 44940 
ggggtagcgc ggtgaatcag gatggtgcgt cgaatggttt gacggcgccg aatgggccct 45000 
cgcagcagcg ggtgatcacg caggcgttgg cgagtgctgg tttgtcggtg tctgatgtgg 45060 
acgccgtgga ggcgcatggg actggaacca ggcttggtga tccgattgag gcgcaggctc 45120 
tgat'tgccac ttacgggcag gggcgggata gcgatcggcc gttgtggttg gggtcggtga 45180 
agtcgaatat tggtcatacg caggcggcgg cgggtgtcgc tggtgtgatc aagatggtga 45240 
tggcgatgcg gcacgggcag ctgcccgcga cgttgcatgt ggatgaacct acgtcggaag 453 00 
tggattggtc ggcgggggat gtccagctcc tcacggagaa caccccctgg cccggcaaca 45360 
gccatcctcg gcgggtgggc gtgtcgtcgt tcgggatcag cggcaccaac gcacacgtca 45420 
tcctcgaaca agcctcgaaa acaccagacg agactgcgga caagagcggt cccgattcgg 454 80 
aatcgaccgt ggaccttcca gcggtcccgt tgatcgtgtc ggggagaaca ccggcagcgc 45540 
tcagcgctca ggcgagcgca ttgttgtcct atttgggtga gcgtggcgat atttccacgc 45600 
tggatgcggc gttttcgttg gcttcctccc gggccgcgtt ggaggagcgg gcggtggtgc 45660 
tgggagcgga ccgcgaaacg ttgttgtccg ggttggaagc gctggcttcc ggtcgcgagg 45720 
cttctggggt ggtgtcggga tccccggtct ctggcggggt tgggttcgtg ttcgccggtc 4 5780 
agggcggaca gtggttgggg atgggccggg ggctctactc ggtttttccg gtgttcgctg 45840 
acgcgtttga cgaagcatgt gccggactgg acgcgcatct ggggcaggac gtgggggtcc 45900 
gggatgtggt gtttggttcc gacgggtcct tgttggatcg gacgctgtgg gcccagtcgg 45960 
gtttgttcgc gttgcaggtt ggtttgctga gcctgctggg ttcgtggggt gtccggccgg 46020 
gtgtggtgct gggccattcg gtcggcgagt tcgcggcggc ggttgcggcg ggagtgttgt 46080 
cgttgccgga tgcggctcgg atggtggcgg gtcgtgcccg gttgatgcag gcgttgcctt 4 6140 
ctggcggtgc catgttggcg gtggctgctg gtgaggagca gctgcggccg ttgttggccg 4 62 00 
atcgggttga tggtgcgggt atcgccgcgg tcaacgctcc tgagtcggtg gtgctctccg 46260 
gcgatcggga ggtgcttgac gacatcgccg gcgcgctgga tgggcaaggg attcggtggc 46320 
ggcggttgcg ggtttcgcat gcgtttcatt cgtatcggat ggacccgatg ttgcaggagt 4 6380 
tcgccgaaat cgcacgcagc gtggactacc ggcgtggcga cctaccggtc gtgtcgacgt 4 644 0 
tgacgggtga gctcgacacc gcaggtgtga tggctacgcc ggagtattgg gtgcgtcagg 4 6500 
ttcgagagcc cgtccgcttc gccgacggcg tccgggtgct cgcgcagcaa ggggtcgcca 4 6560 
cgatcttcga actcggccct gatgcgacgc tgtcggccct gattcccgat tgtcattcgt 4 6620 
gggctgatca ggccatgccg attccgatgc tgcgtaaaga ccgtacggaa accgaaactg 46680 
tggtcgccgc ggtggcgcgg gcgcacacgc gtggtgttcc ggtcgaatgg tcggcgtatt 46740 
tcgccggcac cggggcacgg cgggtcgagt tgccgacgta tgccttccag cggcagcggt 4 6800 
actggctgga aacatcggat tacggcgatg tgacgggtat cggcctggct gcggcggagc 4 6860 
atccgttgct gggggccgtg gttgcgctgg ccgatggtga tgggatggtg ctgaccggcc 46920 
ggttgtcggt ggggacgcat ccgtggctgg cccagcatcg cgtgctgggc gaggtcgtcg 4 6980 
tccccggcac cgccatcctg gagatggccc tgcacgcagg ggcgcgtctc ggctgtgacc 47040 
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gggtggaaga gctcaccctg gaaacaccgc 
gtagtcgtgg ccctgcggga gggaccacag 
ggacgaacga cgccatcgaa atccagctgc 
gaagggtgtc gctgtattcc cgcccggccg 
acgccaccgg cgaactcgtc gtcggcacca 
ctgagggtgc cgagtcgatt gctctcgatg 
tcgagtacgg gccgttgttc caggggcttc 
tcgccgaaat cgccccgccg gccgaggccg 
cagcgttgct ggatgccgcg ctgcaggcgt 
gcgcgtggct gccgttctca ttcaccggcg 
gcagggtgcg gctggagacc aggcgacccg 
gtgggcggtt gctcgcctcg atcgattctc 
tggcgaatcg ggacgctgtc cgcgacgcgc 
cgcagtcgac ggaaccgggt cgctgggccc 
atctcatcaa actcgcaacg gattccgccg 
agaaacttga ttccagcgcg ctggttcctg 
cggatcccgg caccggcgca gccgcacttg 
aagcgtggtt ggctgagccg cggttggccg 
cggtgacgac ggctccgagt gacggtgcat 
tgttgcgtgc cgcgcaggtg gagaacccgg 
ccgccgaatc gtggcgtgcg ttgccgagtg 
tgcggaaggg, cgcggtgcga gcgccccgct 
ccgcggttgt ggcggatccc gaccgaaccg 
ggggcgcggt tgcccgccac ctggtgaccg 
gccgtcgtgg ctgggatgct cctggaatca 
gtgccgtggt cgacgtggtg gcgtgcgacg 
tggcggcggt cccggcggaa tttccgttgt 
ccgacggggt gatcgagtcg ttgtcaccgg 
cggcgggggc gtggaatctg cacgagctga 
tgttctcctc gctttccggt gttgccggcg 
acgcgttcct ggacgcattg gcgcattacc 
tggcctgggg cctgtgggag cagccgagcg 
ggagcaggat cgcgcgcgcc aacccgccgt 
atgccgggct ggcgctggac cgggcagcgg 
tggccgagca ggcgcggtcg ggctcgctgc 
tccgtcgtaa taggcgggct agcggaaccg 
tggtgcggga gcatgccgcg gccgtgctgg 
agcgcgcttt ccgggatctg ggttttgatt 
ttgccggggt gctgggggtg cggttgccgg 
gggcgctggc ccggttcctg caccaggaac 
cggtgacgac gaccagggca ccggtcgccg 
gatgccgttt tcccggtcag gtgtcctcgc 
gcgtggatgc ggtcgcggac ttcccagccg 
atccggaccc ggaacgggct gggaagacct 
ccgatcggtt cgatgcgggt ttcttcggga 
cgcagcaacg gctgttgctg gagctgtcct 
cgggttcgct gagggggagt cggaccggtg 
gcgcccggtt cgccagccga gccccggaag 
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tggtggtccc cgaacgcgcg gcgggtgccg 47100 
tttcaattga aactgcggaa gaacgtgtgc 47160 
tggtgaacgc acccgacgaa ggcggtcggc 47220 
gtgggtcgag aggtgggggt tggacgcgcc 47280 
ccggtggtag ggcggttcct gattggtcgg 47340 
agttctacgt cgctctggcc ggaaacgggt 47400 
aggcggcatg gcgtcgtggt gacgaggttc 47460 
atgcgatggc gtcgggatac ctgctcgacc 47520 
ccgcgctcgg cgaccgcccg gagcaaggcg 47580 
tcgaactttc cgctccggca gggacgatca 47640 
acgcgatatc ggtggccgtg atggatgaga 47700 
tcaggctacg aagcgtgtcg tcgggacagc 47760 
tgttcgaggt gacctgggag ccggtggcga 47820 
tgcttggtga. tactgcctgc ggtaaagacg 47880 
accgctgcgc ggatctggcg gcgctagccg 47940 
atgtcgtggt ctactgcgcc ggagaacagg 48000 
cggagaccca gcagacgttg gctctgctcc 48060 
aggcacgtct ggtggtggtg acgtgtgcag 48120 
cagagctggc acatgcgccg ttgtgggggt 48180 
ggcagtttgt gctggcggac gtcgacggaa 48240 
cgttgggctc gatggaaccg cagttggccc 48300 
tggcttcggt cgccgggcag atcgacgtgc 48360 
tgctgatttc gggcggcacg ggcctgttgg 48420 
aacgcggtgt ccgccgattg gtgttgacgg 48480 
ccgagttggt gggtgagctg aacggcctcg 48540 
tcgcggatcg tgctgatctg gagtcgttgc 48600 
gcggcgtggt gcatgccgcg ggggcgctgg 4 8660 
acgacgtggg agcggtgttc ggcccgaagg 4 8720 
ctcgtgatac ggacctgtcg ttcttcgcgt 48780 
ctcctggtca gggcaattat gcggcggcga 4884 0 
ggcggtcaca gggactgcct gcggtgtcgc 48900 
ggatgacgga gacgctcagc gaggtcgacc 48960 
tgtccaccaa ggagggattg cggctgttcg 49020 
tagttccggc gaagttggac aggactttcc 49080 
ccgcattgtt gacggcactg gtacccccca 49140 
agctcgcgga cgagggcacc ctgctcgggg 49200 
ggtattcgag cgcggctgac gtcggggtcg 49260 
cgttgtctgg tgtggagttg cggaaccgcc 49320 
cgactgcggt gttcgactat ccgacgccga 49380 
tggcagacga gatcgctacg acgccagcgc 49440 
aagacgatct cgtcgcgata gtcgggatgg 49500 
cggaggagct ctggcgtttg gtggccgggg 49560 
atcgcggctg ggatctggca ggcttgttcg 49620 
acgtgcggga aggggccttc ctcaccgacg 49680 
tttccccgcg tgaggcgttg gcgatggatc 49740 
gggaggccat tgaacgggca gggatcgatc 49800 
tgttcgcggg gctgatgtac cacgactatg 49860 
gtttcgaggg gtatctcggc aatggcagtg 49920 
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ctgggagtgt cgcgtcgggc cggattgcgt actcgtttgg tttcgagggt cctgcggtga 49980 
cggtggatac tgcgtgttcg 50000 



<210> 6 

<211> 29736 

<212> DNA 

<213> Saccharopolyspora spinosa 



<400> 6 

tcgtcgttgg tggcgttgca tttggcgggt 
gcccttgccg gtggtgtgac ggtgatgtcg 
cagcggggcc tggcaccgga cgggcggtgc 
ggttggggtg agggtgctgg tttggtgttg 
gggcatcggg tgttggcggt ggttcgtggg 
ggcttgaccg cgccgaatgg tccctcgcag 
gcgggtctgt cggtgtccga tgtggatgcc 
ggtgatccga ttgaggcgca ggctctgatt 
cggccgttgt ggttggggtc ggtgaagtcc 
gttgccggtg tgatcaagat ggtgatggcg 
cacgtggatg caccctcctc gcaggtggat 
gagaacacgc cctggcccga cagtggtcgc 
atcagcggca ccaacgcgca cgtcatcctg 
gcggagccgg attcgagtcc tgttctggat 
aaaacacccg aagcgctatc cgcccaggcg 
gttgatgtct cccctctgga cgttgggatt 
gagcgggcgg tggtgctggg gtcggatcgt 
gctgccggtc atgaggctgc tggcgtggtt 
gggtttgtgt tcgccggtca aggcggtcag 
gagtttccgg cgttcgccgg tgctttcgac 
gggagggaag tcggggttcg ggatgtggtg 
actttgtggg cgcagtcggg tttgttcgcg 
acgtggggtg ttcggcccag cgtagtgctg 
ttcgccgcag gtgtgctgtc gatggcggag 
ttgatgcagg cgttgccttc tggcggtgcc 
gtcggcccgc tgctcgatgg ggtgcgggat 
gggtcggtgg tgctttccgg tgaccgggat 
gggcaaggta tccggtcgag gtggttgcgg 
gatccgatgc tggcggagtt cgccgagctc 
ctgccgattg tctcgacgct gaccggaaac 
gagtattggg tgcgccaggt gcgagagccc 
gtggaccaag gcgtcgacac gattgtggaa 
gttcaagagt gtgtggcgga gtccgggcgg 
cgtgatgagg tccgaacggt gctggacgct 
gtggactggg ggtcattttt cgctggtacg 
gccttccaac gacagcggta ctggctggag 
ggcctgaccg gggcggagca tccgctgttg 
gaggtgctgc tgaccggcag gctgtcggtg 



cagtcgttgc gttccggcga atgcgatctc 60 
acgcccggga cgtttgtgga attctcccgt 120 
aagtcgttcg cggagagcgc ggacggtacc 180 
ttggagcggt tgtcggatgc tcggcggaat 240 
tcggcggtga atcaggatgg tgcgtcgaat 300 
cagcgggtca tccagcaggc gttggcgagt 360 
gtggaggcgc atgggaccgg gaccaggttg 420 
gctacgtatg ggcgcgatcg tgatcccggt 4 80 
aacatcggtc atacgcaggc ggcggcgggt 540 
atgcggcacg ggcaacttcc gcgcacgctg 600 
tggtcggcgg ggagggtcca gctcctgacg 660 
ccctgtcggg tgggggtgtc gtcgttcggg 720 
gaacagtcca cggggcagat ggatcaggca 780 
gttccggtgg tgccgtgggt ggtgtcgggc 840 
gcaacgttgg cgacctattt ggaccaaaat 900 
tcgcttgcgg tgacccgttc ggcgctggat 960 
gacacgttgt tgtctggcct gaatgcgctg 1020 
acgggacctg tcgggattgg tggccggacc 1080 
tggttgggga tgggccgccg gttgtactcg 1140 
gaagcatgcg ccgagctcga tgcgaacctg 1200 
ttcggctccg acgagtcctt gctggatcgg 1260 
ttgcaggtcg gtctctggga attgttgggt 1320 
gggcattcgg tcggggagct agcc'gcggcg 1380 
gcggctcggc tggtggcggg tcgtgcgcgg 144 0 
atgctggcgg tgtccgcgac cgaggcccga 1500 
cgtgttggtg tcgcagcggt taacgctccg 1560 
gtgctcgatg gcattgccgg tcggctggac 1620 
gtttcgcacg cgtttcattc gcatcggatg 1680 
gcacggagcg tggactaccg gtctccacgg 174 0 
ctcgatgacg tgggcgtgat ggctacgccg 1800 
gtccgcttcg ccgacggtgt ccaggcgctt 1860 
ctcggtccgg acggggcgtt gtcgagcttg 1920 
gcgacgggga ttccgttggt gcggagagac 1980 
ttggcgcaga cccacactcg tggtggcgcg 2 04 0 
agggcaacgc aagtcgacct tcccacgtat. 2100 
ccatcggatt ccggtgatgt gaccggtgtt 2160 
ggtgccgtgg tgccggtcgc gggcggcgat 2220 
gggacgcatc cgtggctggc ggaacaccgc 2280 
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gtgctgggcg aagtcgtcgt ccccggcacc gcgttgctgg agatggcgtg gcgggccggt 2340 
agccaggtcg gttgtgaacg tgtggaggag ctcaccttgg aggcaccgct ggtcctgccg 2400 
gagcggggcg ctgcggcggt gcagttggcg gtgggggctc cggatgaggc cggccggcgc 2460 
agtttgcagc tctattcccg aggcgctgat gaagacggcg actggcggcg ' gattgcctcc 2520 
gggctgttgg cccaggccaa tgcggtgccg ccggcggatt cgacggcatg gccgccggac 2580' 
ggcgccgggc aggtcgatct ggcggagttc tacgagcgcc tcgccgagcg cggcttgacc 2640 
tacggtccgg tattccaagg gctccgcgcc gcatggcggc acggcgacga tatcttcgcc 2700 
gaattggccg ggtcaccaga cgcctcgggt ttcggcatcc acccggcgct gctggacgct 2760 
gcactgcacg cgatggcgct tggtgcttcg cccgactcgg aagcgcgtct gccgttttcc 2820 
tggcgtggcg cccagctgta ccgcgctgaa ggagcagcgc ttcgggtacg gctctcgccg 2880 
ctgggctccg gtgcagtctc attgacgttg gtggatgcca cagggcgacg agtcgctgcg 2 94 0 
gtggaatcgc tttcgacgcg accggtctcc accgaccaga tcggtgccgg tcgcggcgat 3 000 
caagagcggc tgctgcacgt cgagtgggta aggtcggctg aatctgcggg gatgtctctg 3 060 
acctcctgcg cggtggtcgg tttgggcgaa ccggagtggc acgctgcgct gaagaccact 3120 
ggtgtccaag tcgagtccca tgcggacctt gcttcgttgg ccaccgaggt tgccaagcgg 3180 
ggttcagctc ctggtgcggt catcgtcccg tgcccgcgac cccgagcgat gcaggagctg 3240 
ccgaccgccg cgcgaagggc gacgcaacag gcgatggcga tgctgcagca atggcttgcc 3300 
gatgaccggt tcgtcagtac gcgcctgatc ctgctgacgc atcgggcggt ctccgcagtt 3360 
gctggagaag acgtgctcga cctggtacac gcgccgctgt ggggcttggt ccgcagcgcg 3420 
caagcggagc acccggaccg attcgccttg atcgatatgg acgacgagcg agcatcgcag 3480 
acggcactcg ccgaagcgct gactgcggga gaagcgcagc tcgcggtgcg gtcgggagtt 3540 
gtgctggcgc cccgcctcgg ccaggtgaag gtgagtggag gtgaagcgtt caggtgggat 3600 
gaaggcaccg tgctggtcac cggcggaacc ggcgggctcg gggccctgct cgcacgccat 3660 
ctggtcagcg cccacggtgt gcggcacctg ttgctcgcaa gtcgccgtgg tctggcggcg 3720 
cccggagcgg atgagctggt ggccgagctg gagcaggccg gcgccgacgt cgcggtcgtc 3780 
gcgtgcgact cggcagatcg ggactcgctt gcgcggctgg tggcgtcggt gcctgcggaa 3840 
aacccgttgc gggtggtggt gcacgccgcc ggtgtgctgg atgacggtgt gctgatgtcg 3900 
atgtcgccgg agcgcttgga cgcggtgttg cggcccaaag tggatgccgc gtggtacctg 3 960 
cacgagctga ctcgggaact cggtctgtcg gcgttcgtgt tgttctcctc ggtcgcgggc 4 020 
ctgttcggcg gtgcggggca gagcaattac gctgccggca acgctttcct ggatgccttg 4 080 
gcgcattgcc ggcaggccca ggggctgccc gcgctgtcgc tggcctccgg gctgtgggcg 4140 
agtatcgatg gaatggcggg cgacctcgct gcggcagatg tggagcggct gtcgcgggca 4200 
ggcattggcc cgctttcggc accgggaggg ctggccttgt -tcgacgctgc cgttggctcg 4260 
gacgaaccgt tgctggcacc ggtgcgactg gatgtcgaag cactgcgtgt gcaggcccga 4320 
tccgtgcaga cccggattcc ggaaatgctg catggcatgg caatggggcc aagccgccgc 4 380 
actccgttca cttccagggt tgagccgttg cacgaacggc tggccggatt gtcggagggc 4440 
gaacgtcggc agcaagtgct ccagcgcgtc cgcgccgata tcgcggtggt actggggcac 4500 
ggcaggtcga gcgatgtgga catcgagaag cctttggccg agctgggttt cgactcgctg 4560 
acggccatcg aactccgcaa ccgtctcgct accgccaccg gactgcggct tcccgcgacg 4 620 
ctggccttcg accacggcac tgcggcggca ctcgcccagc acgtgtgcgc gcagctaggc 4680 
accgcgaccg cgccggcacc gaggcgaacc gacgacaacg acgccacgga gcccgtgagg 4740 
tcgctcttcc aacaggcgta tgcggctggc cggatacttg acgggatgga tttggtgaag 4 8 00 
gtcgctgccc agttgcgacc ggtgttcggt tcgcctggcg agctggaatc cctgccgaaa 4 860 
cccgtccagc tttcccgtgg tcccgaagag cttgccttgg tgtgcatgcc ggcgctgatc 4 920 
gggatgccgc ccgcacagca gtacgcgcgg atcgccgccg ggttccgcga tgtgcgggac 4980 
gtttcggtga tcccgatgcc tggattcatt gcgggagaac cgctgccgtc cgccatcgag 5040 
gtggcggttc ggacgcaggc ggaggcggtg ctgcaggaat tcgccggggg ctcgttcgta 5100 
ctggtcgggc attcctccgg gggctggctg gcgcacgagg tagccggtga gctggagcgt 5160 
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cgcggggtcg tcccggccgg ggtcgtactg 
ccgaggttct ccgtggcgat ggcccaccgg 
atgcaggatg tcggtatcac .cgcgatgggc 
ccgacgccga tcggtgctcc gacgctgttc 
gaagggcggc cgtggacaga tgactcctgg 
gtccaggtgc cgggcgacca cttctcgatg 
gcagtcgcga gttggcttga caaactcaac 
tccttttagg accttctggg cggcaccggc 
ggctcgccga tcttgacggc gcacgatgcg 
cgctcgtggc catcggcctg gcgaatcatg 
cgagcccgca ttccgaggtg aggggcaccc 
tcacctcgaa attcgtccga tttgggcggt 
ttcttgttga cagagcggtg agaagccgct 
tgctcaagcg ccaccgtttg acgaccgcca 
tcagcggctg cggaaccgcc gccgcacttc 
gcaatgttgg tatggcctcg ggcgggggcg 
aggccgattt cctcgccacc gcgacaccgg 
cgtacgggaa ccggtcggac aagacctgca 
gcgtggacat gagcaactcg ccgatcgagg 
acgcgcccaa ggaattcacc ctccagcccg 
tcctggccga cagcggcgac ccgaacgccc 
cggacatgtc cgaggcccag ccggtcaacg 
cgaagtacct gcgagtcagc tcgctggtgt 
gtgagtcccg cagattcgac ctcgccgtgc 
tcgttgtgac cagcgttgtt cacgtccggg 
ggcgcctcca acggggcctg gcatccgggg 
ctgtcgggcg ttgtcacgcc gccggcctcg 
ccagcgcggt gtggcggcgg ccggagccga 
tcgaaccgac gaggaccacg accttcttgg 
gctgcagcat catccaggcg cttgtggtca 
cctgcgaacg atggatggtc accgcgtagg 
cgtagtcgat gtcctcgtcc tcgtcggttc 
gggcggacac gacgccctgc gtgccgttga 
tgcggatctg cgtgaccttg tcgccgacgc 
gcaggccctc cctggccggg gtgatcgctt 
ctgcggggcc tcgatgcatc ggggcgagga 
acttccgcgg aatccggcgg gcgacgacgt 
cctctacgtg gaacaggaag aagtcggtca 
tgattcggtg cgcgttggtc accaccccgg 
gccgcacgtg tggaatcggg gtgccagggg 
cgaccgacgg gagctggtcg acgtcgccga 
ccttggccag tttgttggct aacagcaggt 
ggtcggcgtc cagcgggttg tcccggtcgt 
gcaggcggtg cacggtcgcc gcgtcgtgtc 
gtcccgtcgg cgcggcgagg atcaccttgg 
accgcacggt gaagctcttg ccgcagcctg 
tcagggccag cttgacggcg cgctcctgcg 
gcaaccagtc gagggccttg tgccaatcga 



ctggacacct acatccccgg tgagatcacg 5220 
acgtatgaga agctcgcgac tttcacggac 52 80 
gggtacttcc ggatgttcac cgagtggact 5340 
gtgcggaccg aagattgcgt cgcagaccct 54 00 
cggccagggt ggactctcgc ggatgccacg 5460 
atggacgagc acgccgggtc caccgcacag 5520 
cagcgcaccg ctcggcaacg ctgacgggcg 5580 
caccccggcg gtgccgcctt ccgtggtcca 5640 
cggcacgcgc gctgatcgtg attccgctgc 5700 
tcctttcggg caacgtcaaa cgaattcgtc 5760 
ttgggtggct gagccgctca agggtgcccc 5820 
ggacgcaacc ccggtgggcg tggtgcgtct 5880 
gacacacctg agaggaaaag gggagcatga 594 0 
tcaccggcct tctgggggga gtactgctgg 60 00 
agtcctcgcc ggcgcccggg catgacgcgc 6060 
gcggggacat cggcacgtcg aactgctcgg 6120 
tgaaaggcga ccccggcagt ttcatcgtgg 6180 
cgatcaacgg cggcgtgccg aacctcaagg 6240 
acctgccggt cgaggacgtg cggcttcccg 6300 
gtcagagcgc gtacgccggc attggcatgg 6360 
atgtcctcac cgggttccag tcctcgctgc 6420 
ttctcggcga cggcaacgtg aagttcgccg 6480 
ctaccgcaga cgagctgcgc taaaacccat 654 0 
ggcgcctccg gcgaagcgtc cgtacgtttg 6600 
cgcagcgctg gtacatactc aggcgtctcg 6660 
ccgtcgagtg cggcggcgct gacgcgttct 672 0 
aaccggtccc gccccgtcgg agccggtggt 6780 
cggtgcgcac cgcctgcccg agggcctttt 684 0 
cccgggtgac cgccgtgtag agcaggttgc 6900 
aggggatcac cacgcacggg tattcgcttc 6960 
cgtggaccag ttcgtcgagt tctgtgaagt 7020 
gcacggtcat ggtctgtgct tcgttgtcga 7080 
acacgccgtt ggcgcccttg tcgtagttgt 714 0 
ggaagatccg tccgccgaac cgccgctctg 7200 
cctgcaacag ctggttcagc gcgcctgcac 7260 
cctgcacgtc ggtgcgcggg ttgaaccgga 7320 
cgacggtgag ctcggcggtc ggttcgcttt 7380 
gcccgtgtgt cagcggatag tccccggcgt 744 0 
actcggcggc ctgccggaac accftcgttga 7500 
cgagcagatc gcgcagtacc tcaccggctc 7560 
ccagcagcag gtgcgcgccg ggcgcgatcg 7620 
cgagcatgga cgcctcgtcg accacgacga 7 680 
aggcggcgtc cccgcccggc tggagttgga 7740 
cggtgagctc ggtcagccgc ttcgccgctc 7 800 
cctttttcgc ctgagctaat gcgatgatcg 7860 
gacctccggt gagcacggcg accttctcgg 792 0 
cctcggcgag ttcggcaccg gtagcgcggc 7 980 
cgtcggcgaa gacgggcatc cggtccgcgc 8040 
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tggtgttcag cagccgggac agctggttgg 
cgaggtagat cgcgaccgtc ggcacctcgt 
caccttcctc ggtgacgagt tcggcgaggc 
ggatcttcac cgcctcggcg atcagctcgt 
tggactccga cagcgtgaac tgaaggcccg 
ttcccaccgc tttggcgatg gtgtcggcgg 
ccagccggta tggctcttcc ttgacggtcc 
tcttcaccgc cagcgaggtc gagacgccga 
tcgccttctg ctcctcccac gcgtcggcga 
ggacctcgat cagccgcgcg ggttcctgct 
ggtcgacgat cttctcggcg agtttggggc 
agcggcggat accttgcacg gtcgcaggca 
gccgcccgta ctgggggtgc gacccccacc 
gcgcgcccag cagcgcgccg acgaccgtca 
gcgcgacggt gtagccgctc tcctcgttgg 
gcaccgcagt ccacgtggcc gactcccgtc 
ggctgtcaag caaaccggcg gtcaccacat 
cggcgtcagc gggtgggcgt tcgcgatgcc 
ttcgtcctcg cgcagggcct ggagtaggtg 
ggataaaccg gcttgacgga ggagatggaa 
gacgaccccg agtcatgcgt cgacgacctg 
acgtgggggc gtgcgatcag gtcgcggcgc 
agccgcgacg gactggttcc gctcctcgag 
cgatgccggc cgcgttcgcg aggtggccag 
tgccttttga acatctggtc aattcagctt 
ccgccaattg tttgctcgcc gtgcgccctg 
gctggcgaag gtgtcgattg gctcggacgg 
cacgagcgtc ggcgatgccc ggtactatct 
cgggcgcggt gcgctcctga tcatgaatca 
tcgtggggca ctgtgaggat gtgcatcaca 
atcgcgggct gccgcggctg tgcccagtcg 
gtcagcccct gcgcgtcgag tgcgactttg 
tcggttgccg gtgcgttcgc ctgcaccgac 
tcggtgacga ccgagccatt cgcggtgcgg 
aactccgcct gctgcggtcc gttcgcgtcc 
caggcgtggt cgttgttgtt catcgactgg 
aacgtcagac cggtgcggcc cagcgaggtc 
tagaactgcg gcctgcgtag gaacaggaag 
aagtgcgcct gttggtatgg aagtttcgcg 
tagcgctcgc cgtagatcgc gtgggtgagg 
gcgggcaccg gtgccgggtc ctcggcccag 
.ctggacagga acgcgcccaa gttcggcacc 
cgttccgggt ccggtcgcac gtcggcgtag 
acgttggcga accagccgga gccgtcgggt 
gtgtacttcc gcgccatcgt gacgaggttg 
atctccggca gcatcacctc gaagttgtag 
tagcccgaag gactctgccc gttcgcggca 
cgcagcgcgg ggtcggggtc ctggttcagc 
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acctggttcg cgtagtggat tgtgggctgc caggcgctac cgttgcccgg gttgaggagc 10980 
cacgtcatcg ctttgcgcag cgccgaggtg atctgcgact ggcgttgcgg caggatgttc 11040 
gccgcgcgca gcagggcgtt ggtcttgctc aggtacccca acccgaagcc ggtggcggcg 11100 
agcccgtgct cggtcggcga atactccggc cacgagccgt catcgtgctg caaccctagg 11160 
tagtgtccga gcccggcgtc gagcgcggcg agcagagtcg cgtcgccccg gtacgggttc 11220 
cacgtgcggg attgcgtggc gaaccaggcg agcgtgtaga cgtgctcctg gacgcgggcg 11280 
ttgtaggaca ccgccgggct gcgccaccag cctccggcga agaagccgct cgagtccatg 11340 
tcggcgacca tcggcgagac cgcggtgagg tacgacgcga atcgctgctc ctcgggtgcg 114 00 
aacaggcgcc ggttcggtgt ggtgggcggc ggggcgggca gtgcgagtgc gcgggtcggc 114 60 
agtgcggcga gcagtccgag agcggcggcg ccggtcatca ggctgcgacg gctgaacgta 11520 
gtcacgggcc tacctccttg tggccgatca accctcaccc gctgcgtagc cgcacgtcaa 115B0 
gatgataatt cgaattatta tgggcttgac gacgcgtagg ccgacgacgc agaattcctg 11640 
ccaattcgta ttggcaagcg ggggtgctcg tggcccgacg gctcacccag caggacatcg 11700 
cccggatggc aggagtcagc caggccacgg tgtcgctggt gctcaacaac cggaaggacg 11760 
gcaacgtccg gatcgcggcg gagacccgtg cgcaggtact ggaggtgatt cggaagaccg 11820 
gctacgtcgc gaacccgatc gcccgcagga tgcgcgatcg gcacaaccgc atcct.cggcg 11880. 
tgttcaccta cgaggcggtg ttcccgagca cccacgcgaa cttctaccag tcgttcctcg 11940 
aaggcatcga ggaacaggcc gaggaggtcg gctgtgacct gttgttgttc accagcgcca 12 000 
aggccaccgg ggagcggcgg cggattttcg gcgacgacag ccgggtgcgg ctcgccgacg 12060 
gcagtctgct gctcggtcgc acggtcgacc gcgacgacct gacccagctg ctcgccgaaa 12120 
gcatcccgta cgtctccatc ggacgacgcg acgacgcggg cggtccggtg ccgcacgtcg 12180 
gggccgacta ccgcaccgcg gtgcgagacc tggtggaccg cgcggtcgcg ctcggccacc 12240 
gcgggttcgc gtacgtgggg tctggtgggg gcgcggagtc gtccgcggat cggctgcgag 12300 
gcttccgcga agccgttgcc gcacatggcg tccaagggat gcatgtggag accccacagc 12360 
tcgatcagct gcgcgaagcg ggcgtcaccg ctgtgctcac cgaagaggtg tcggacgggg 12420 
ccgcgctcgt gctcgccggg cgcgaacgcg ggctctccgt gccgggcgac ctcgccgtgc 12480 
tctcgctcgg tgccgctacc cggtcggcac cggacgacga cgtggcgctc accggtttcc 12540 
gcatccccag gcgcgagatg gggcgccggg cggtgcaggc gctgaccgag gtgctcgaaa 12600 
acggcaccac accgcaagaa ctgctcccgt gcgagttcgt cgagggctcg acgctgggcg 12660 
caccacgcct ttgaccagga ggaactgttg ctcgaccaca ccacggacgt tgtcgtcgtt 12720 
ggcggcggac tcggcggtgt- cgccgccgca ctcgcgttgc tgcgcgcggg ccgccgggtc 12780 
gtgctcaccg aggagtacga ctggctcggc ggccagctga ccagccaggc cgtgccgccc 12840 
gacgagcaca gctgggtgga gcgcttcggc gtcaccgcga gctaccgggc gttacgcgac 12900 
ggcatccgcg actactaccg ccgccactac ccgctgaccc cgcgcgcacg ggcgtggcgg 12960 
gagctcaacc ccggtgcggg caacgtgagc aggctctgcc acgagccccg cgtcgccgtc 13020 
gcggtgatcg acgagatgct ggcgccgttc cgcggcagtg gcaggctgac cgtgctgcag 13080 
ccgtaccggc cggtggccgc gcacaacgac ggcgaccgga tcgtgtcggt gaccgttgcg 1314 0 
caccgcgaca ccggtgaaca gatcgagctc tccgcgccgt acatcctgga cgcgacggag 13200 
acgggtgaac tgcttccgtt gtccagcacg gagtacgtca ccggcttcga gtccactctg 13260 
gacaccggcg agccgagtgc gcccgacgtc gcgcagccgg cgaacatgca ggcggtgtcg 13320 
gtgtgcttcg tggtcgacca cgtcgacggc gaccacacca tcgacaaacc ggcgcggtac 13380 
gacttctggc gcgcgtacca gccggacttc tggggcgacc ggatgctgtc gttccgctcc 1344 0 
cccaacccgc gcacgctcgc gatctccgaa cgtacgttca ccccgaaccc ggacgacgac 13500 
ccgctcggcg tcgtgtcgga ccagcggctc agtgccggtg acagcaatct gtggacgttc 13 560 
cggcgcatcg ccgcgcgtcg caacttcgtc gagggtgcct acgacagcga catctgcctg 13620 
gtgaactggc cgatcatcga ctacttcgag tcgccggtga tcgacgtgcc ggacgccgac 13680 
gcgcacatcg ccgcggcgcg ggaactctca cgttcggtgc tctactggct acagaccgag 13740 
gcgccgcgcc cagacggcgg caccggcttc cccggcctcc gcctgcgcgg cgacgtcacc 13800 
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ggcagcgcgg acggtctcgc gcaggcgccg 
gagcacacga tcgtcgaaca ggacctctcg 
cagcacgccg acgccgtggg tgtcggcatg 
ggcgacaact acatcgacgt cgcgagctgc 
ccgcaacggg tggagaacct gctacccgcg 
aacggttccc accggctgca cccagtcgag 
gctgccttct gcctggcgca ccgagtcacc 
ctcgcggact tccagcagtg tctggaacgc 
tccggctact gacgcaggga gacgaaaatg 
atcgtcgcag ggctgttcgc cctcaccggc 
ccgaagtcgc tgcgcatgac cgtgtggact 
gacatcgccg ccgagtacaa ggcctcgcac 
gtgcccgccg acggctacac caccacgctc 
gacctggcct ggatcctgga ggagtcggca 
ccggtgcgcg gcaagatcga gaaggccgac 
gagaaggacg gcgaactgta cgcctacccg 
aacaccgacc tggtgaaggg cgcctcggcg 
gcctctgcgt cggcggccgc ctccggcaag 
cagaactggg cagtgctgtc ctctatctgg 
gacggtcgct cgtgcgggtt ctccagcagc 
aaggccatct tcaccgacaa ggcgattccg 
ggcgacgcgg cgatggcgat cggccagatc 
ttcggctgga cgctgctgcc gctgccggcc 
caggccggga tcggtgtgct gaagcagtcg 
gccttcctca ccaaccagac caactccgcc 
tcgtcgctgc tcaacgcgga gacgctcgcc 
ctgcagtccg tcgtcgtcga cggcatcaac 
caggaggagc tgaaccagac gatccgcgcc 
aacgtgcaga acgtgctgaa cgacgtgtgc 
tgacggcggt cgcacactcc acgcgggccg 
agcgcaggcg cgacaatctg gttggttacc 
tcgcgttcgt gctggtgccg ctggccttgg 
tgctcgccgg cacgttcgag ttcgtcggcg 
agaagctgcg cgactcgctg gtggcgacca 
acctgtcact agcgctgctg ctggccgtgc 
tcttccgcac gttgttcttc tctcccgtcc 
ggcagctgat actgcagccg gagggcagcg 
acgggccgaa ctggctgcgc ggtgagtcga 
tgctcaagaa cgtcggcctg aacatggtgc 
agccgttgta cgaggcggcg aagatcgacg 
tcaccttgcc gttgatcagc ccgacgatcc 
cgctgcaggt gttcgcgcag atcgcggtgc 
ccgtactgat ctactacctg taccagcagg 
cgacgatctc ggtactgctg ttcgtcatcg 
tgcgccggaa gtgggtgctg catgaggctt 
tgcctgctgt gcgtgccttt cgtgttccca 
cccatcagcg agatcttctc cacctcgccg 
cggcaggtgt tcgagatgca gccgttcgcg 



tacatccgcg agtccaggcg catcagggcc 13 860 
ctcgccgtgc gcggcgacaa gggtgcggtg 13920 
taccgtatcg acctgcaccc ctccaccggt 13980 
ccgttcgaga tcccgctcgg cgcgctgatc 1404 0 
ggcaagaaca tcggcaccac ccacatcacc 14100 
tggaacgtcg gcgaggtcgc gggcgcgctc 14160 
cctcgcgcgg tgcgcaatac ccctggcctg 14220 
gacggggtcg agctccgctg gccggacgtg 14280 
acaaagctgt cacgacgact cacggcactc 14340 
tgcggtggtg gatcaaccgc acagtccgga 144 00 
gccaacgcgg cgcatctcaa gctgctcaac 14460 
ccggacatcg ccgagatcaa gttcgactcg 14520 
accacccaga tcgccggcgg taacgcgccg 14580 
ccggacttcg tggcgtccgg tgcgctcgcc 14640 
gagctcgtgc cgtccgcgac gaagctgtgg 14700 
ttctccacct cgccgttcgg cttgttcgtc 14760 
gactggacct gggaccaggc gatcgcggct 1482 0 
ggcggcctgg tactgccgga cttcaagtac 14880 
cgcggctggg gagctgatgc gtggagcgcg 14940 
gagatgaacg acgcgatgtc cttcttgcac 15000 
ggccccggca cgacggtgga cttcttcgcc 15060 
tcccggtcca gtgcgttgaa ggacgcgaag 15120 
ggtccgaagg gtgactacgc ggtgatcggg 15180 
cacaacgtcg acgccgcgac ggacttcctc 15240 
aagctcgcgc agttcttccc cccggcgcgg 15300 
aagagcaacc cgttgatcaa ggccgagcaa 15360 
aagggcgtcg tgaagccgag ccacaagggt 15420 
gcgctcgacc cgttgtggaa gccggacgcg 15480 
accaagatca aaccgctgct ggagaacaag 15540 
tagcccgcaa ggggacgtcg tactggacgc 15600 
tgttcgtcgc gcccgcgctg ctcggcagca 15660 
tcggctggta cagcctcaac gagtggaacg 15720 
cgcagaacta ccaacagctg ctcgccgacg 15780 
cttggttcgc cgccggcctg gtcgtgctca 15840 
tgctcaacca gaagctgagc ggcaccacgg 15900 
tcgtgtcgct ggtggcgtgg acgctggtgt 15960 
tcaacggact gctcgggttc ttcggtgccg 16020 
ccgcgatggt ctcggtgatc gtcgtgcagg 16080 
tgttcctcgc ggcgctgcaa ggc'gtgccgc 16140 
gggcgggtgc ctggacccgg ttccgccgca 16200 
tgctcacgtc gatcatcacc atcgtcggct 16260 
tcacgcaggg cggtccgggc acgtccacga 16320 
cgttccagtt ccaccacttc ggctacggcg 16380 
tcgccgcact caccctgctg cagtggcaga 16440 
agggtcaaga tcacgctata cgggctgctg 16500 
acctggtgga tgatcacctc gtcgatgaag 16560 
ctgccgtcgg agtggacgtt ctccacctac 16620 
cagcagtact ggaacagcct ctacatcgcg 16680 
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gtgatcgtca cgacaggcac aatggcggtg gcggcgatgg ccggatacgc gttcgcgcgt 1674 0 
atccggttcc cgggccagaa cgtgctgttc gtggtcgtcc tgatcggcct gctcatcccg 16800 
agcgaggtca ccatcgtgcc gctgttcaag atgttccagt cgctcggcct gaccaacacg 16860 
cactggccgc taatcatcgt gccgatcttc ggggcgccct gtgtgctggc gatcttcatc 16920 
atgcgccagt tcttcatcgc cctgccgtcg gagctggagg aggccgcgcg gatggacggc 16980 
ctcggacgag cgggcatctt ctaccgggtg gccctgccgc tgtcccggcc cgcgctgggc 17040 
gccgtcgcga tcttcacgtt cctgcactcg tggaacctct acctggagcc gatcgtctat 17100 
ctgtccacac cggacatgta cacgttgccg caagcgctca cgcagttcgt cgacaactac 17160 
ggcggcccga tgtggaacgt gcagctcgcc gccgcgacca tgaccgcagt gccgattctc 17220 
gtcgtgttcg tgctggcgca gcggcagttc atcgagggac tcgcgcacac cggggttaag 17280 
gggtgagtcc cgcgatcgac ccggagatcg ccgcgctggt agacctcgcc agaccaccgg 1734 0 
tgacgccagc gggaatcgac gcggtccggg cgggaggacg cgtcgtcacc gatgccgagc 17400 
tgacgcgcgg tggcaccgtg acgttcgccg atgccgatgc cgatggcgtg cccgtgcttg 174 60 
tgctgcgccc agcgggtgtg ccacgtcttc cggtgctgca cctgcacggc ggcgggatgg 17520 
tcgcgggcac ccggcgcacc gacctgcacg tgctggccga gtgggtgtcg gagctgggtg 175 80 
tcgtgctggt gtcgccggag taccggctcg cacccgagca cccgcacccg gcgccgtcgc 1764 0 
aggactgctt ccgggtgctg gagtggatgt cccgcaacgg cttcggcccg cctgtcgttg 177 00 
cgggtacatc ggcgggcggc gggctggcgg ctgcggtgac gctgatggcc cgcgacctcg 17760 
gcggtccgcc gatcttgccg tcggcgcgca actgctcgaa cgcgtccagc gtctccgcca 1782 0 
acggcgtgct accgcgccag tgcagcaggt agacgtccag gtgatcggtg cccaaccgcc 178 80 
gcagactgcg ttcgcacgcc tcgacggcgc cacgccgacc cgcgttgtgc ggatacacct 17940 
tgctgaccag gaaaacctcg tcgcggcgcc cggcgatcgc ggcaccgacg acctcctcgg 18000 
cgcccccgct gccgtacatc tcggcggtgt cgatcagccc caggccgagg tccaggccgc 18060 
gccgcagcgc ggcgacctcc gccgcgcgct gcccacgccg ctcccccatg ccccaggtgc 18120 
cctgccccag caccggcaat tcctgcccgc cacgcagttg aaggcctcgc atgcacactc 18180 
cgtttcagtc agcgctgctg cggcagcacg gccaccagct cgaagccacc gtcctcggac 1824 0 
ggcccggcgg ccaaccgccc accggccatc* tgcacccgct ccgacaggcc ggccagcccg 183 00 
gacccgccgc tgccgacctc gatggccggt ccccgccggt cgggggcgcc gtttgtgacg 18360 
gcgacgtgca gcgagccctc gcggaacacc agccgcacct tgatgtcagc accgggtgcg 18420 
tgcttggaag cgttcgtcaa cgcctcctgg acgacccggt aagccgcgcg ctcaaccgca 184 80 
ggcgacaccc cgtgcgggtc gcggatctcc gaggaaagct ccacatccac accggcggcg 1854 0 
cgcgcccgca cgaccagctc cgggatcgtg ctcaacccgg cccgctgcct gggggcgccg 18600 
agcccgagcg tcgcgcgcat ctcgtccagc gcctgcttcg ccaacccgcg cagccgcccg 18660 
gcggtctcct tcgcaccggg atcctcggtg gtggccgcca acgccgccga ctcgaccgcg 1872 0 
attaaggtca cgtgatgtcc caccgcgtcg tggatctccc gcgcgatccg ggcccgctcc 18780 
tccgcgcgcg ccgtctcggc cttcgcctgc aactccgcct cggtggccgc ctccagcttc 1884 0 
cgcagactgt ccgccagctg ctcccgcaac gccaccagcg cccccaatgc cgtcggcgcg 18900 
cctgccgcca gcagcacgaa cgccacggtc aggatgatcg acccgacccc cagcgactgc 18960 
atgatcatca ccggtgccac ggcgaccgcc gcggtcagca acacccagcc gaccaacatc 19020 
tggatccgcg gctgcgcgcg ccccaacgcg tacaacgcca cgaccgtcgg cgcccaaccc 19080 
aaaccaccgg ccaacgctgg catgcacagc agcaccgcga tccgcggcca ccgccgcaac 19140 
ggcagcagcg ccgaagccac caaaccggac accgtcgcat agccgaacgg gtcgggcgcg 19200 
gccagcacag ctgccgccgg aatcccgacg accagcgcct caacgccgag ccgagaccag 19260 
cgcaccgaac tcaccggaag ccgcccgcca cgcgctgcgc gatcaacgcg gcctgcaccc 19320 
ggttctgcgc gccgagcttg ctcaacaccg tcgacacgta gctcttgacc gtcgcctcgg 19380 
tcagcccgag ccggaccccg atatcggcgt tggaccgacc atccgccaac aactggagca 19440 
cctgccgctc ccgcccggac aacccctgca ccagctgcgt ctcgtgcgcg ttgtcggaca 19500 
gactgcggaa cctgggcaac aaccgcgcag tgatccgcgg atccagcacc gccccaccgg 19560 



85 



WO 01/16303 



PCT/EP00/08013 



ccgccagatc gtgcaccgcc cgaaccagca ccgcgggctc cgcgtccttg agcaggaacc 19620 
cgtgcgcccc gaaccgcaag gcatcggcca cgtaatcgtc caggtcgaac gtcgtcagca 19680 
ccgccggcac cggcggattc gccaacgtcg ccaagtcccg caacgccgtc agcccgtcct 19740 
tgccgggcat ctgcacgtcg atgagggcga cgtcgacgcg gtacgcgcgc accgcggcca 19800 
gcaactcatc cccgttgccc gcctcggcga ccacccggat cgagccgtcg ccctccaaca 19860 
acacccgcag gccacgccgc agcatcgcct cgtcatcggc aagcacaacg cggatcggcc 19920 
ccccgccgtc gtcagcccgc acctctcggt cacctccctg ttccgagggt cgccagcaga 19980 
ctagcggggc cgcgaaagcg cagcgcacac gccacccgag cgcactgtgt cagcggcgcc 20040 
cgcccttggc cttccgctcc cgcacccgca ccgagatccg caccgggcta cccacgaacc 20100 
cgaacgtctc ccggaacctg cgctccagga accgccggta cccggcctcc aagaacccgg 20160 
tggtgaacag cacgatcgtc ggcggccgcg actgcgcctg ggtggcgaac aggatcttcg 20220 
gctgcttgcc gccccgcacc ggcggcgggt tggccgcgac cagctcggaa agccaaccgt 20280 
tcaaccggcc ggtggagatc cgggtgtccc acgaatccag cgccgtgcgc agcgtcgggg 20340 
ccagtttcgc caccgcgcga ccggtcttcg ccgacacgtt gacccgctcg gcccaccgca 20400 
cccgcaccag ctcgcggtcg atctccttct ccaactggtg gcggcggtcc tcgtcgacca 20460 
ggtcccactt gttgtaggcg atcaccaatg cccgaccggc ctcgacgacc atagtgatca 20520 
cccgcaggtc ctgctcgctc aacggctcgg agccgtcgat gagcacgatc. accacctcgg 20580 
ccgcctcgat cgcggccttg gtgcgcagcg acgcgtagta ctcggtgccg cttgcggtct 20640 
tcacgcgctt gcgcagcccg gcggtgtcga cgaaccgcca cacctggccg tccagctcca 2 0700 
ccagcgagtc caccgggtcc accgtggtgc ccgcgacgtc gtgcaccacc gaccgctctt 2 0760 
ccccggtcag cttgttcagc a'ggctcgact tgcccacgtt cggcttgccg accaatgcca 20820 
cccgccgcgg cccaccggtc gcgccgaaga tctcgcgcgg cgtctccggg aacacctcca 20880 
ggacggcgtc cagcaggtca ccggaaccgc gcccgtgcaa cgcgctgacc ggcatcggct 2094 0 
cgcccaaccc cagcgaccac agcgaatgca cgtcggagac gcctcgctgg tcgtcgacct 21000 
tgttggcggc cagcagcacc ggacgcttgg accgccgcag caccctggcc acggcttcct 21060 
cggtctcggt ggcaccgacc cgggcgtcca ccacgagcag caccgcatcg gcggtgtgca 21120 
tggccagttc cgcctgcgcg gccaccgacg cctgcagtcc cttcgcgtcc ggctcccagc 21180 
caccggtgtc gaccaccgtg aagcggcgcc cgttccacaa cgcgtcgtag gccacccgat 2124 0 
cccgggtgac accaggggtg tcctgcacga cggcttcccg acggcccagc agccggttca 21300 
ccagcgtcga cttgcccacg ttgggacggc cgaccacggc cagcaccggc tgcggcttcg 21360 
tcggctcctc gccctcgacc gcttcgacct cgtcgaactc agcccactca gcctcgtcgg 21420 
accacgtgcc gtccaggcct tccaccgact cgtcggtcac gctcgtttcc ctctcgaata 21480 
gtcttggctc tggtccgcga ccagccgatc cagctcggcg atcagctcga ccagttcggt 21540 
gcgcaccgcc tcggtcgcca cgaccagccc ggcgcggcct ttgcccgccg gcagcacgat 21600 
cggctccccg aacaacacgt cgatcctcgg tagcagccta cggcccgggc catccgggcg 21660 
gcgcgtgccc cggcaggcca ccggcaggat cagcgcctcc gacgtccgcg ccaaccacgc 21720 
cgcaccgtgc tgcgcattgg tcacatcgcc atcaccccgg gtgccctccg ggaacacccc 21780 
gacgagacca ccggcccgca gcacccgcac cgccgccagc agcggggcgc ggtccggcgc 21840 
accgcggcgc accccgatct gcccgatgcg gcgcaggaac cagcccagcg cac'cgcggaa 21900 
catctcctgc ttgatcagga acaccgcgtg gcgcgggatc attccgaaca gcagcggccc 21960 
gtccatcatc gaactgtggt tggccaccag taccaccggg cccgtgcgcg ggatccgctc 2202 0 
cgcgccgtgg atccgcaccc ggtacggcag ctgcacgatc cggcgcgaaa tccactggcc 22080 
gcaccggtgc attccaccgg acgcgccctc gggcagggtc tcgtcgctca tggcgtcgtc 2214 0 
cgctccgccg cgaccagcag gccacggcac tccacgtgct tgcgcaactg cgtgagcacc 22200 
tccaccacac ccaggtccgt ggtgtcgagc tccaccgcgt cgtcggcctt gcgcagcggc 22260 
gccaccttcc gccccgagtc cagcgcatcg cggcgctgca cgtccgcgtg ggtccggtcc 22320 
aggtcaccgg cccgtccctc ggagacgtcc tgcgccgtgc ggcgctgcgc gcgggcgtgc 22380 
gcggacgccg tcaggtagac cttcaaaccc gcatccgggg ccaccacggt gccgatgtcg 2244 0 
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cgtccctcga cgaccatgcc gcctggcgac accagcgcct caccgatcag cttccgctgg 22500 
tgggcgacca actgctcccg cacctcggcg accgccgaca ccgccgacac cgcgccggtc 22560 
acctccggcc cgcggatctc ccgccccacg tccgagccgt ccaggaagat ctcagggccc 22620 
•tccggatcgg tgcccaccgt cagccgcgct gcgcgcacca cgcctgccac tgcggccgga 22680 
tccgacgggt cgacctcggc gcgcagcacc gccagcgtca cggcgcggta catcgctccg 22740 
gtgtcgaggt aggtagcccc caatgcgaac gccaacttgc gggacaccgt cgacttgccg 22800 
gtgcccgaag ggccgtcgag cgccaccaca ccaaggagcc tggcgtgtgc cacgtgcatg 22860 
tccttcccga gccggaaccg atcggcccga cacccggccg gaccaaccgt ccattctgcc 22920 
tggtccgctc ctgccgatcc caatcgcatc cagatccaat ccggactgcg ggacaaatac 22980 
gaaagttcta cgaataaggt cttgggtgtg gatgtaactg tgaccccctt gccagggctc 23040 
ggcacgcagc aggacttcac cacccgatcc ggccaccgca tcggcgtgat cacctaccgg 23100 
gacggccgct tcgagctgat cgtctccgac cacgaagacc cagacaaggt ggccgcctcg 23160 
gtcgccttga ccacctccga gaccagcacg ctggccaacc tgctgggcgc cccgcagctg 23220 
gtcgcccagc tcaacgagca gcacagcgag gtcgccggga tcaccacctg gcagctgtcc 23280 
gtcgccccgg gctcccccta cgccggccga accctcggcg acacggagat gcgcacccgc 23340 
acctccgtgt cgatcgtcgc ggtcgtccgg gacggcaccg tgcacccctc accgcggccg 234 00 
gacttccagt tctccgccgg cgacctggtg gtggtcgtgg gcaccgcgaa ggggctacgc 23460 
gcagccagcg aaatcctgga aaagggctga tccggcccgt cccgaacact ttcccggcca 23520 
cctgaccggg acacgcgtca ccaacggcgg tccagcgcgt gccgatcccc taacaactcc 23580 
atacgcaggg cacacgcgag aggaaacacg ttgcaagaca cggcgatctc cctaatagaa 23640 
ctgggtgcgg ttttcttcgg tctgggcatc ctcgggagac tcgcatggaa aatcggggtg 23700 
tcaccgatcc cgctctacct gatcggcggg ctggccttcg gcaccggtgg cctggtcccc 23760 
ctgcacggca tcgaaccgtt cacccacctc gcctccgaga tcggcgtggt cctgctgttg 23820 
ttgctgttag gcctggaata ctccgcgggt gaactcgtca ccggcttacg ccgctcctgg 23880 
ctggccggtg tcatagacat cgtgctgaac gcggtacccg gtgcgctggt ggccctgctg 23940 
ctgggttggg gtccgctcgg cgcgttcaca atggccggcg tcacctacat ctcttcctcc 24000 
ggcatcatcg cgaaggtgct cggagacctg ggtcggctcg gcaaccggga aacgccggtg 24060 
atcctgtcga tcctggtctt cgaagacctg gcgatggccg tctacctgcc gatcctgacc 24120 
gccgtgctgg ccggggtcag cttcctcggc ggactgaccg cggtcggggt gtcgctggtc 24180 
gtgatcaccc tggtgctggt ggtcgcgctg cgcttcggca agtacgtgtc cgcgctggtc 24240 
gacagccccg acccggaggt cttcctgctg cgcctgctgg gggcggcgct gctcgtcgcg 24300 
gggatcgcct cggagctgca ggtctcggcg gcggtcggct cgttcctgct gggcatcgcg 24360 
atctccgggt cgaccgcggc gaacgcgacc cgcatgctgg agccgctgcg cgacctgttc 24420 
gccgcgctgt tcttcgtcgt gttcggcctc aacaccgatc cgagccagat cccgccggtg 24480 
ctgccggtgg cgctgctgct ggcggtggcg acctgcgcga cgaaagtgct caccggctgg 24540 
ttcgccgccc ggatgcaggg tgtcggccgg atgggtcggc tgcgggccgg tgccgcgctg 24600 
gtggcgcgcg gcgaattctc catcgtgatc gccgggctgg ccgtcgcctc cggcgcggtg 24660 
ccgggcgagc tcgccgccct cgccaccgcc. tacgtgctgc tgatggcgat cgtcggcccg 24720 
gtcgcggccc gcgtcgtcga gccggtggcc cggatgttcc tccccagcca ggc'gaagaag 24780 
gcctgaccgg aaaagcctcc ggctgtcccg agcggacggg ccggaggctt tctcaacgct 24840 
gcggataacg cgggcgggtc tggcaaacgc aacaccggtg cgcgaacagc ctcagaggcc 24900 
cactgcgcgg tagagcgagc cgatctcctg gcggttcaac ggccggatcg cgcccgggcg 24 960 
ctcgttggtc agccgcacat cgccgatccg cgtccgcacc aggcgctgca ccgggtagcc 25020 
gacgtgcttg agcagccgcc gcacgatgcg gttgcggccc tcgtgcaaca cgatctccac 25080 
cagcgaccgg ttctggttca tgtcgaccag cttgaaccgg tccaccttca ccagaccgtc 25140 
ctccagctcc acgccctcgc gcagctgctg cccgaggtcc ttcgggatcg accccagcac 25200 
ctccgccagg tacgtcttgc gcaccttgta cgacgggtgc atcagccggt tcgccagctc 252 60 
gccgtcgttg gtgatcagca gcaggccctc ggtgtcctgg tccagccgcc ccacgtggaa 25320 
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cagcttgccc tcccgttccc gcaggtaatc accgatgcac ggacgcccct tgtcgtccga 253 80 

catggtgcac aggatgccgc gcggcttgtt cagcagcagg tgcgtcagat cgtcgttgac 25440 

catcacccgg ttgccgtcca cgtggaccac .cgcggtcgtc gggtcgaccc ggcgccccag 25500 

ctcggtgacg acctcgccgt cgacctcgat ccgcccctcg acgatcatct cctcggccgc 25560 

gcgccgcgag gccaccccgg cctgggacaa cactttctgc agccgcacgc cttccgaatc 25620 

ggcagtgctg ctgtacgggg aacggtgctc agacgtcatc aatcgaatcc acttcgggca 25680 

acaagggggc gagcggcggc aactccttca acgacgacag ccccagccgc tccaggaaca 25740 

gctcggtcgt gcagtacagg atgccgcccg tctccgggtc ggttccggcc tcttcgataa 25800 

ggccgcgccc caccagggta cggatgacgc cgtcgacgtt cacaccccgt actgccgcga 25860 

cccgcgaacg cgtcaccggc tgccggtacg cgatcaccgc gagggtctcc agcgcggcac 25920 

gcgtcagctt cgaacgctgc ccgtccaaca ggtaccgctc cacgtacggg gcgtacttct 25980 

cccgcgtgta gaaccgccag ccgtcgccga cccgccgcag gtcaacgccc cgttccgcct 26040 

cggcgtagcc gtccgagagc cgccgcagcg cggaccggat ccgggccacc ggctgctcca 26100 

gggtgtcggc cagcagctcc tcgccggccg gcacgtcgac cacgagcagc aacgcctcca 26160 

gcgccgcatc cagcgccgaa tcggaggtca ggtccggcgc actcggcgcc tccaccgcct 26220 

cggaatcccc agcggccgcc gacggctcca acggttcctg cggctcagcg ggcggcgggg 26280 

tcggcgactc ctccgcgacc tcgcccgcgg cctccggctc cagctccgtc acccgtactc 26340 

ctcttcttca tggctggcgc gctgcgcctc ggcttcggcc tccgcctgcg ccacgctgcc 26400 

gcccacccag ctcacgatca gctcacccag cggggtctcc tgctcgaaca gcaggacctt 26460 

ctcccggtag agctcgagca gcgccaggaa ccgcgccacc acctcgacgg tgtgcccgca 26520 

gtccgacacc agctccgaga acgtcgcgct gcccttctcc gccaaccgca cccgcagcag 2 6580 

cgccgcgtgc tcgcgcaccg aaacgccgtg ctggtggatg tggtccagcg aaacggtcgg 26640 

cggtggcttc ggccggaaca ccgcggccgc gatctcggcg aaccgctgcg gcggaacccc 2 6700 

gatcaccact tccggcagca ggttcgcgaa ccgctcctcg accgacaccg aacgcggata 26760 

gcgccgcagc gccccggcct ccaactcgcc gaacagcgcg gccacctgct tgtacgcgcg 26820 

gtactgcagc agccgcgcga agagcaggtc gcgggcctcc agcagcgcca ggtccgcctc 26880 

gtcctcgacc tccgcggccg gcagcagccg ggccgccttc aggtctaaca gcgtcgcggc 26940 

gaccaccagg aactcggtgg tttcgttcag gtcccaccgc tcgccgagcg ccttggtgta 27000 

ggcgatgaac tcgtcggtga ccttgtgcag cgctacttcg gtaacgtcca gctggtgctg 27060 

cgagatcagc tgcagcagca ggtcgaaggg gccctcgaag ttgtccagcc gcaccgtgaa 2712 0 

ccggccgcct gcggcgggat cctcgctcga ctgcgccgcc ggtaccgcgg ggtctgtcac 27180 

tctccgggct cctgcaggcg ccgcaccagc accgagtccg cgccgttctt gtcgaggtcg 2724 0 

gcgagcacca cggcaaccgc ttcgcgcacc acgcggcctc ggtccacggc gatgccgtgg 27300 

tcggcccgca atgccagccg cgcctgctcc aagcccagca actcctcgtc ggacacgtac 27360 

acggtgatct tcgaatcgtg cttgcgccgt ccagagccgc cccggcccgc cgcgggtgcc 27420 

gcctgctccg ggtccggcgg attcgcgtga tccgtccgag gaacagtagt acggaacaat 274 80 

tcggccgctc cgggcagggc gacccgacgc gtcatcgggc gatcacctcg cgcgccagtg 27540 

cgcggtacgc ctgcgccccg gccgagcggg gcgcccagcg ggtgatcggc tcgccggcca 27600 

ccgtggtctc cggaaaccgc accgtgcggt tgatcaccgc atcgaagacg atgtcgccga 27660 

acgcctcgac gacgcgcgcc atcacctcgc gcgagtgcag cgtgcgcggg tcgaacatgg 27720 

tcgccaggat gccgctgatc tccagcctcg ggttcagccg ctcccgcacc ttctcgatgg 27780 

tgtcgatcag cagcgccacc ccgcgcaggc tgaagaactc gcactccagc gggattatca 27840 

cggcgtccgc ggcggccaac gcgttcaccg tcagcagccc cagcgacggc tggcagtcca 27900 

ccagcacgta gtcgtactcc gccagcgccg gctgcagcac ccggtgcagc gtctgctccc 27960 

ggcccacctc cgcgaccagc tgcacctcgg ccgcggacag gtcgatgttg ctgggcagca 28020 

ggtccatgcc ctcgacggtg gtctgccgca ccacgtcatg gatggcgacc ggccgttcca 28080 

tgatcacgtt gtagatggtc tggtcgagct ggtgcggctg gaccccgagg ccgaccgaga 2814 0 

gcgcgccctg cggatcgaaa tccaccagca gcacccggcg cccgtattcc gccagcgctg 28200 
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cgccgaggtt gatcgtcgac gtggtcttcc cgacgccgcc cttctggttg cacatcgcga 28260 
gcaccgatgc gggaccgtgc cggtccagca gcggcggctc ggggatgtgc cgacgtggcc 28320 
ggccagtggg accgaggccg cgggggttcg gcgccaggcc gacctcctcc tcgcggcccg 28380 
ccgtctcccg ggcgatgctc aggtcgaccg cgccccggga ccgcccttcg gcggcgggct 28440 
gcggcagcga catggcgatc tgactccttt tggtgccggt ggcgatcaaa cgcagcctaa 2 85*00 
gtggcattcg ttaccagcgg caacgcgcct cgccgtggtg tcggaagttc cctgtgatct 28560 
gcgtgtgatc aacgtctacc gcgtacacca ccccctcgca ttcgcagcgt caccaagtca 28620 
ctccgtagcg tgtgcgcgcg gatgcgtcgt cgcgtagacc tcccgcagcg tgttcaccgt 28680 
gatcaaggtg tagacctggg tcgtcgtgac cgaggcgtgg cccagcagtt cctgcaccac 28740 
gcggacatcc gcgccgccct ccacaaggtg cgtggcgaac gaatgccgca gcacgtgcgg 28800 
cgaaaccgag ccgttgatcc cggcgcgctg cgcggcggtc ttcaacgcgt tccacgagct 28860 
ctgccgggaa agccgggtac cgcgcgaatt caggaacacc gcggcactac cccgcccgcg 28920 
cgcggccagg ccgggcctgg cccgtaccag gtacgcgtcc agcgccgcca gcgccggtcg 2 8980 
gccgatgggc accagccgct ggcggccgcc cttcccttcc agcagcacgg tccggttggt 29040 
gccgtcgatg tcgtcgaggt ccaacccgac cgcttcggag atccgcgctc ccgtcgaata 29100 
cagcacttcc agcagcgccc ggtccc^cag gccccgggca tcctcgcccc cggcgtgatc 29160 
gagcagcttc ccgacgtcgc cgatggaaag cgccttcggc agccgcttcg gcgggctcgg 29220 
cggcgcgaca tcgcgcgcta cgtcgaccgg cagcagcccc tcggcgtgcg cgaagcggtg 29280 
caggccgcgg gcggccacca aagctcgggc cgacgacgac ggcgccagcg cgggccgctg 29340 
cccggtgcct tcccggagct ccgccaagaa cccggacagg tgctccgaac gcacctcgcc 29400 
gagcccagaa atccctgccg cgaccaggaa ttcggcatac cggcgcaggt ccctggcgta 29460 
ggagtcgaga gtgctgcggg cggtaccgcg ctcgacggca aggtggtcga ggtacccggt 29520 
gatcgccccg cgcaggtccg gcggcagctc ctcgaaaccc ctcaccgggc ggcgaagcgc 29580 
tggggccggt ccggccattc ggcatccgcc gggcgaggct ccacccggcc ctcgcggacc 29640 
gcgtgggcgg cgagcagccc cgccaccccg gggccgttga cgatctcccc ggccagcgcc 2 9700 
atccgcaccg cctcggcgag cgggaagcgg cggatc 2 9736 

<210> 7 
<211> 828 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (825) 

<223> ORF1; O-Methyltransf erase 

<400> 7 

gtg ttg cca ggt ggc gca cca aca teg cag cag gtt ggg cag atg tat 4 8 
Val Leu Pro Gly Gly Ala Pro Thr Ser Gin Gin Val Gly Gin Met Tyr 
1 5 10 15 

gac ctg gtc acg ccg ttg ctg aac teg gtc gcg ggc ggc ccc tgc gec 96 
Asp Leu Val Thr Pro Leu Leu Asn Ser Val Ala Gly Gly Pro Cys Ala 



ate cac cac ggc tac tgg gag aac gac ggg egg get tec tgg cag cag 144 
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He His His Gly Tyr Trp Glu Asn Asp Gly Arg Ala Ser Trp Gin Gin 
35 40 45 

gcc gcc gac egg etc acc gac ctt gtc gee gaa egg ace gtg etc gat 192 
Ala Ala Asp Arg Leu Thr Asp Leu Val Ala Glu Arg Thr Val Leu Asp 
50 55 60 

ggc ggc gtt cga ctg etc gat gtg ggg tgc ggt acc gga caa cca gcg 240 
Gly Gly Val Arg Leu Leu Asp Val Gly Cys Gly Thr Gly Gin Pro Ala 
65 70 75 80 

ctg cgc gtc gcg cgc gac aac gcg ate cag ate acc ggc ate acc gtc 288 
Leu Arg Val Ala Arg Asp Asn Ala He Gin He Thr Gly He Thr Val 
85 90 95 

age cag gtg caa gtg gcc ate gcc get gat tgc gca cgc gaa cgc gga 336 
Ser Gin Val Gin Val Ala He Ala Ala Asp Cys Ala Arg Glu Arg Gly 
100 105 110 

eta age cac egg gtg gac ttc teg tgc gtc gat gcc atg tec ctg ccg 384 
Leu Ser His Arg Val Asp Phe Ser Cys Val Asp Ala Met Ser Leu Pro 
115 120 125 

tac ccg gac aat get ttc gac gcc gcc tgg gcc atg cag . teg ctg ttg 432 
Tyr Pro Asp Asn Ala Phe Asp Ala Ala Trp Ala Met Gin Ser Leu Leu 
130 135 140 

gag atg tec gaa ccg gac cgt gcc ate egg gaa ate ctt cga gta etc 480 
Glu Met Ser Glu Pro Asp Arg Ala He Arg Glu He Leu Arg Val Leu 
145 150 155 160 

aaa ccc ggt ggc ate etc ggc gtc acc gag gtc gtc aaa cga gaa gcg 528 
Lys Pro Gly Gly He Leu Gly Val Thr Glu Val Val Lys Arg Glu Ala 
165 170 175 

ggc ggc ggg atg ccg gtg tec ggg gac agg tgg ccg acc ggc ctt egg 576 
Gly Gly Gly Met Pro Val Ser Gly Asp Arg Trp Pro Thr Gly Leu Arg 
180 185 190 

ate tgc ctg get gag caa ctt ctg gaa teg ctg cgt gca gcg ggg ttc 624 
He Cys Leu Ala Glu Gin Leu Leu Glu Ser Leu Arg Ala Ala Gly Phe 
195 200 205 

gag ate etc gat tgg gag gac_ gtg teg teg agg acc egg tac ttc atg 672 
Glu He Leu Asp Trp Glu Asp Val Ser Ser Arg Thr Arg Tyr Phe Met 
210 215 220 

ccg cag ttc gcc gaa gag etc get gcg cac cag cac ggg ate gcg gac 72 0 
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Pro Gin Phe Ala Glu Glu Leu Ala Ala His Gin His Gly He Ala Asp 
225 230 235 240 

agg tac ggg ccg get gtc gec ggc tgg gec gec gcg gtc tgc gat tat 768 
Arg Tyr Gly Pro Ala Val Ala Gly Trp Ala Ala Ala Val Cys Asp Tyr 
245 250 255 

gag aaa tat gec cac gac atg ggc. tat gcg att ctg acg gcg egg aag 816 
Glu Lys Tyr Ala His Asp Met Gly Tyr Ala He Leu Thr Ala Arg Lys 
260 265 270 

ccg gtc ggc tga 828 
Pro Val Gly 
275 



<210> 8 
<211> 275 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 8 

Val Leu Pro Gly Gly Ala Pro Thr Ser Gin Gin Val Gly Gin Met Tyr 
15 10 15 

Asp Leu Val Thr Pro Leu Leu Asn Ser Val Ala Gly Gly Pro Cys Ala 
20 25 30 ( 

He His His Gly Tyr Trp Glu Asn Asp Gly Arg Ala Ser Trp Gin Gin 
35 40 45 

Ala Ala Asp Arg Leu Thr Asp Leu Val Ala Glu Arg Thr Val Leu Asp 
50 55 60 

Gly Gly Val Arg Leu Leu Asp Val Gly Cys Gly Thr Gly Gin Pro Ala 
65 70 75 80 

Leu Arg Val Ala Arg Asp Asn Ala He Gin He Thr Gly He Thr Val 
65 90 95 

Ser Gin Val Gin Val Ala He Ala Ala Asp Cys Ala Arg Glu Arg Gly 
100 105 110 

Leu Ser His Arg Val Asp Phe Ser Cys Val Asp Ala Met Ser Leu Pro 
115 120 125 

Tyr Pro Asp Asn Ala Phe Asp Ala Ala Trp Ala Met Gin Ser Leu Leu 
130 135 140 
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Glu Met Ser Glu Pro Asp Arg Ala He Arg Glu He Leu Arg Val Leu 

145 150 155 160 

Lys Pro Gly Gly lie Leu Gly Val Thr Glu Val Val Lys Arg Glu Ala 

165 170 175 



Gly Gly Gly Met Pro Val Ser Gly Asp Arg Trp Pro Thr Gly Leu Arg 
180 185 190 

He Cys Leu Ala Glu Gin Leu Leu Glu Ser Leu Arg Ala Ala Gly Phe 
195 200 205 

Glu He Leu Asp Trp Glu Asp Val Ser Ser Arg Thr Arg Tyr Phe Met 
210 215 220 



Pro Gin Phe Ala Glu Glu Leu Ala Ala His Gin His Gly He Ala Asp 
225 230 235 240 

Arg Tyr Gly Pro Ala Val Ala Gly Trp Ala Ala Ala Val Cys Asp Tyr 
245 250 255 

Glu Lys Tyr Ala His Asp Met Gly Tyr Ala He Leu Thr Ala Arg Lys 
260 265 270 



Pro Val Gly 
275 



<210> 9 
<211> 1173 
<212> DNA 

<213> Saccharopolyspora spinosa . 

<220> 

<221> CDS 

<222> (1) . . (1170) 

<223> ORF2; Glycosyltransf erase 

<400> 9 

atg cgc gta etc gtc gtt ccc ttg ccc tat ccg acg cat etc atg gca 4 8 
Met Arg Val Leu Val Val Pro Leu Pro Tyr Pro Thr His Leu Met Ala 
1 5 10 15 

atg gtg ccg ctg tgc tgg gcg ctg caa gca tec ggg cac gag gtc ctg 96 
Met Val Pro Leu Cys Trp Ala Leu Gin Ala Ser Gly His Glu Val Leu 
20 25 30 
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ate gec gca cca cca gag ctg cag gcg acc gcg cat ggt gca ggt etc 144 
lie Ala Ala Pro Pro Glu Leu Gin Ala Thr Ala His Gly Ala Gly Leu 
35 40 45 

acc acg gee ggg ate cgc ggg aac gac agg acc ggc gat acg ggt gga 192 
Thr Thr Ala Gly He Arg Gly Asn Asp Arg Thr Gly Asp Thr Gly Gly 
50 55 60 

acc acg cag ctg cgc ttt ccc aat ccg gcg ttc ggt cag cgc gac acc 24 0 
Thr Thr Gin Leu Arg Phe Pro Asn Pro Ala Phe Gly Gin Arg Asp Thr 
65 70 75 80 

gag gca ggc egg caa ctg tgg gag cag acc gcg tec aat gtc gcg caa 288 
Glu Ala Gly Arg Gin Leu Trp Glu Gin Thr Ala Ser Asn Val Ala Gin 
85 90 95 

age teg etc gat cag etc ccc gaa tac ctt cga ctg gec gag gee tgg 336 
Ser Ser Leu Asp Gin Leu Pro Glu Tyr Leu Arg Leu Ala Glu Ala Trp 
100 105 110 

cga ccg tea gtg ctg ttg gtc gac gtc tgc gcg ctg ate ggc egg gtg 384 
Arg Pro Ser Val Leu Leu Val Asp Val Cys Ala Leu He Gly Arg Val 
115 120 125 

etc ggc gga ttg etc gac ctg ccg gtc gtg ctg cac cgc tgg gga gtc 432 
Leu Gly Gly Leu Leu Asp Leu Pro Val Val Leu His Arg Trp Gly Val 
130 135 140 

gac ccc acc gca ggc ccc ttc age gat cga gee cac gag ttg ctt gac 480 
Asp Pro Thr Ala Gly Pro Phe Ser Asp Arg Ala His Glu Leu Leu Asp 
145 150 155 160 

ccg gtg tgc egg cac cac gga ctg acc ggc ctg ccc act ccc gag etc 528 
Pro Val Cys Arg His His Gly Leu Thr Gly Leu Pro Thr Pro Glu Leu 
165 170 175 

ate etc gat ccc tgt ccg ccg age ctg caa gca age gac gcg ccg caa 576 
He Leu Asp Pro Cys Pro Pro Ser Leu Gin Ala Ser Asp Ala Pro Gin 
180 185 190 

ggc gca ccg gtc cag tac gtg ccg tac aac gga age ggc gca ttc ccg 624 
Gly Ala Pro Val Gin Tyr Val Pro Tyr Asn Gly Ser Gly Ala Phe Pro 
195 200 205 

gca tgg ggc gcg gcg cgc acc tea gca egg egg gtc tgc ate tgc atg 672 
Ala Trp Gly Ala Ala Arg Thr Ser Ala Arg Arg Val Cys He Cys Met 
210 215 220 
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ggc cgc atg gtg ctg aac gcc acc ggg ccg get ccg ctg ctg cgc gca 720 
Gly Arg Met Val Leu Asn Ala Thr Gly Pro Ala Pro Leu Leu Arg Ala 
225 230 235 240 

gta gcg get gcc acc gag ttg ccc ggc gtc gag gcc gtg ate gcc gtt 768 
Val Ala Ala Ala Thr Glu Leu Pro Gly Val Glu Ala Val He Ala Val 
245 250 255 

ccc ccc gag cac egg gca ctt etc acc gac eta ccc gac aac gcc egg 816 
Pro Pro Glu His Arg Ala Leu Leu Thr Asp Leu Pro Asp Asn Ala Arg 
260 265 270 

ate gcc gaa teg gtc ccg etc aac ctg ttc ctg cgt acc tgc gag ctg 864 
He Ala Glu Ser Val Pro Leu Asn Leu Phe Leu Arg Thr Cys Glu Leu 
275 280 .285 

gtc ate tgc gcg ggc ggc teg gga acg gca ttc acc gcg acc cga etc 912 
Val He Cys Ala Gly Gly Ser Gly Thr Ala Phe Thr Ala Thr Arg Leu 
290 295 300 

ggc ate ecg caa etc gtg ctt ccc cag tac ttc gac cag ttc gac tac 960 
Gly He Pro Gin Leu Val Leu Pro Gin Tyr Phe Asp Gin Phe Asp Tyr 
305 310 315 320 

gcg cgc aac etc gcc get gcc ggg gcg ggc ate tgc ttg ccg gat gag 1008 
Ala Arg Asn Leu Ala Ala Ala Gly Ala Gly He Cys Leu Pro Asp Glu 
325 330 335 

cag gcc cag tec gac cac gaa cag ttc acc gac tea ate gca acg gtg 1056 
Gin Ala Gin Ser Asp His Glu Gin Phe Thr Asp Ser He Ala Thr Val 
340 345 350 

etc ggc gac acc ggc ttc get get gcg gca ate aaa etc age gac gag 1104 
Leu Gly Asp Thr Gly Phe Ala Ala Ala Ala He Lys Leu Ser Asp Glu 
355 360 365 

ate acg gcc atg ccc cat ccc gcc gcg ctg gtg egg acg ctg gag aac 1152 
He Thr Ala Met Pro His Pro Ala Ala Leu Val Arg Thr Leu GlU Asn 
370 375 380 

act gcg gcc ate cgt gcc tga 1173 
Thr Ala Ala He Arg Ala 
385 390 



<210> 10 
<211> 390 
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<213> Saccharopolyspora spinosa 
<400> 10 

Met Arg Val Leu Val Val Pro Leu Pro Tyr Pro Thr His Leu Met Ala 
15 10 15 

Met Val Pro Leu Cys Trp Ala Leu Gin Ala Ser Gly His Glu Val Leu 
20 25 30 

lie Ala Ala Pro Pro Glu Leu Gin Ala Thr Ala His Gly Ala Gly Leu 
35 40 45 

Thr Thr Ala Gly lie Arg Gly Asn Asp Arg Thr Gly Asp Thr Gly Gly 
50 55 60 

Thr Thr Gin Leu Arg Phe Pro Asn Pro Ala Phe Gly Gin Arg Asp Thr 
65 70 75 80 

Glu Ala Gly Arg Gin Leu Trp Glu Gin Thr Ala Ser Asn Val Ala Gin 
85 90 95 

Ser Ser Leu Asp Gin Leu Pro Glu Tyr Leu Arg Leu Ala Glu Ala Trp 
100 105 110 

Arg Pro Ser Val Leu Leu Val Asp Val Cys Ala Leu He Gly Arg Val 
115 120 125 

Leu Gly Gly Leu Leu Asp Leu Pro Val Val Leu His Arg Trp Gly Val 
130 135 140 

I Asp Pro Thr Ala Gly Pro Phe Ser Asp Arg Ala His Glu Leu Leu Asp 

145 150 155 160 

Pro Val Cys Arg His His Gly Leu Thr Gly Leu Pro Thr Pro Glu Leu 
165 170 175 

He Leu Asp Pro Cys Pro Pro Ser Leu Gin Ala Ser Asp Ala Pro Gin 
180 185 190 

Gly Ala Pro Val Gin Tyr Val Pro Tyr Asn Gly Ser Gly Ala Phe Pro 
195 200 205 

Ala Trp Gly Ala Ala Arg Thr Ser Ala Arg Arg Val Cys He Cys Met 
210 215 220 

Gly Arg Met Val Leu Asn Ala Thr Gly- Pro Ala Pro Leu Leu Arg Ala 
225 230 235 240 
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Val Ala Ala Ala Thr Glu Leu Pro Gly Val Glu Ala Val He Ala Val 
245 250 255 

Pro Pro Glu His Arg Ala Leu Leu Thr Asp Leu Pro Asp Asn Ala Arg 
260 265 270 

He Ala Glu Ser Val Pro Leu Asn Leu . Phe Leu Arg Thr Cys Glu Leu 
275 280 285 

Val He Cys Ala Gly Gly Ser Gly Thr Ala Phe Thr Ala Thr Arg Leu 
290 295 300 

Gly He Pro Gin Leu Val Leu Pro Gin Tyr Phe Asp Gin Phe Asp Tyr 
305 310 315 320 

Ala Arg Asn Leu Ala Ala Ala Gly Ala Gly He Cys Leu Pro Asp Glu 
325 330 335 

Gin Ala Gin Ser Asp His Glu Gin Phe Thr Asp Ser He Ala Thr Val 
340 345 350 

Leu Gly Asp Thr Gly Phe Ala Ala. Ala Ala He Lys Leu Ser Asp. Glu 
355 360 365 

He Thr Ala Met Pro His Pro Ala Ala Leu Val Arg Thr Leu Glu Asn 
370 375 380 

Thr Ala Ala He Arg Ala 
385 390 



<210> 11 
<211> 753 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (750) 

<223> 0RF3; O-Methyltransf erase 
<400> 11 

atg ccc tec cag aac gcg ctg tac ctg gac ctg etc aag aag gta etc 48 
Met Pro Ser Gin Asn Ala Leu Tyr Leu Asp Leu Leu Lys Lys Val Leu 
15 10 15 
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acc aac acg att tac agt gat egg ccg cat ccg aac gec tgg cag gac 96 
Thr Asn Thr He Tyr Ser Asp Arg Pro His Pro Asn Ala Trp Gin Asp 
20 25 30 

aac acc gac tac agg cag gec get egg gee aaa ggc acg gac tgg cca 144 
Asn Thr Asp Tyr Arg Gin Ala Ala Arg Ala Lys Gly Thr Asp Trp Pro 
35 40 45 

act gtc gcg cac acg atg ate ggt jctg gag egg ctg gac aac etc cag 192 
Thr. Val Ala His Thr Met He Gly Leu Glu Arg Leu Asp Asn Leu Gin 
50 55 60 

cac tgc gtg gaa gec gtg etc gca gac ggt gtt ccc ggg gat ttc gee 240 
His Cys Val Glu Ala Val Leu Ala Asp Gly Val Pro Gly Asp Phe Ala 
65 70 75 80 

gag acc ggt gtc tgg egg ggc ggc gca tgc ate ttc atg cgc gcg gtt 288 
Glu Thr Gly Val Trp Arg. Gly Gly Ala Cys lie Phe Met Arg Ala Val 
85 90 95 

etc cag gca ttc gga gat acc gga cgt acc gtc tgg gta gtg gat tec 33 6 
Leu Gin Ala Phe Gly Asp Thr Gly Arg Thr Val Trp Val Val Asp Ser 
100 105 110 

ttc cag gga atg ccg gaa age tct gcg caa gac cac caa gcg gac cag 3 84 
Phe Gin Gly Met Pro Glu Ser Ser Ala Gin Asp His Gin Ala Asp Gin 
115 120 125 

get atg gcg ctg cac gag tac aac gac gtg ctt ggc gta teg ctt gag 432 
Ala Met Ala Leu His Glu Tyr Asn Asp Val Leu Gly Val Ser Leu Glu 
130 135 140 

acc gtc egg cag aac ttc gee cgc tac ggg ctg etc gac gaa cag gtc 4 80 
Thr Val Arg Gin Asn Phe Ala Arg Tyr Gly Leu Leu Asp Glu Gin Val 
145 150 155 160 

agg ttc etc ccc ggc tgg ttc egg gac acc ttg ccc acc gee ccc ate 52 8 
Arg Phe Leu Pro Gly Trp Phe Arg Asp Thr Leu Pro Thr Ala Pro He 
165 170 175 

•cag gaa etc gee gtg eta cga etc gac ggc gac etc tac gaa tec aca 576 
Gin Glu Leu Ala Val Leu Arg Leu Asp Gly Asp Leu Tyr Glu Ser Thr 
180 -185 190 

atg gac tea ttg egg aac ctg tac ccg aag etc teg ccg ggc gga ttc 624 
Met Asp Ser Leu Arg Asn Leu Tyr Pro Lys Leu Ser Pro Gly Gly Phe 
195 200 205 
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gtc ate ate gac gac tat ttt ctg ccg tec tgc cag gac gcg gtg aag 672 
Val He He Asp Asp Tyr Phe Leu Pro Ser Cys Gin Asp Ala Val Lys 
210 215 220 

ggg ttc cgc gcg gaa etc ggg ate acg gaa ecc ate cac gac ate gac 720 
Gly Phe Arg Ala Glu Leu Gly He Thr Glu Pro lie His Asp He Asp 
225 230 235 240 

ggc acg ggc gee tac tgg cgc cgc age tgg tga 753 
Gly . Thr Gly Ala Tyr Trp Arg Arg Ser Trp 
245 250 



<210> 12 
<211> 250 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 12 

Met Pro Ser Gin Asn Ala Leu Tyr Leu Asp Leu Leu Lys Lys Val Leu 
15 10 15 

Thr Asn Thr He Tyr Ser Asp Arg Pro His Pro Asn Ala Trp Gin Asp 
20 25 30 

Asn Thr Asp Tyr Arg Gin Ala Ala Arg Ala Lys Gly Thr Asp Trp Pro 
35 40 45 

Thr Val Ala His Thr Met He Gly Leu Glu Arg Leu Asp Asn Leu Gin 
50 . 55 60 

His Cys Val Glu Ala Val Leu Ala Asp Gly Val Pro Gly Asp Phe Ala 
65 70 75 80 

Glu Thr Gly Val Trp Arg Gly Gly Ala Cys He Phe Met Arg Ala Val 
85 90 95 

Leu Gin Ala Phe Gly Asp Thr Gly Arg Thr Val Trp Val Val Asp Ser 
100 105 110 

Phe Gin Gly Met Pro Glu Ser Ser Ala Gin Asp His Gin Ala Asp Gin 
115 120 125 

Ala Met Ala Leu His Glu Tyr Asn Asp Val Leu Gly Val Ser Leu Glu 
130 135 140 

Thr Val Arg Gin Asn Phe Ala Arg Tyr Gly Leu Leu Asp Glu Gin Val 
145 150 155 160 
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Arg Phe Leu Pro Gly Trp Phe Arg Asp Thr Leu Pro Thr Ala Pro He 
165 170 175 

Gin Glu Leu Ala Val Leu Arg Leu Asp Gly Asp Leu Tyr Glu Ser Thr 
180 185 190 

Met Asp Ser Leu Arg Asn Leu Tyr Pro Lys Leu Ser Pro Gly Gly Phe 
195 200 205 

Val He lie Asp Asp Tyr Phe Leu Pro Ser Cys Gin Asp Ala Val Lys 
210 215 220 

Gly Phe Arg Ala Glu Leu Gly He Thr Glu Pro lie His Asp lie Asp 
225 230 235 240 

Gly Thr Gly Ala Tyr Trp Arg Arg Ser Trp 
245 250 



<210> 13 
<211> 1188 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1185) 

<223> ORF4; O-Methyltransf erase 

<400> 13 

atg agt gag ate gca gtt gec ccc tgg teg gtg gtg gag cgt ttg ctg 4 8 

Met Ser Glu lie Ala Val Ala Pro Trp Ser Val Val Glu Arg Leu Leu 
1 5 10 15 

etc gcg gcg ggt gcg ggc ccg gcg aag etc cag gaa gca gtg cag gtg 96 
Leu Ala Ala Gly Ala Gly Pro Ala Lys Leu Gin Glu Ala Val Gin Val 
20 25 30 

gec gga ctg gac gcg gtg gec gac gec ate gtc gac gaa etc gtc gta 144 
Ala Gly Leu Asp Ala Val Ala Asp Ala lie Val Asp Glu Leu Val Val 
35 40 45 

cgc tgc gat ccg ctg teg ttg gac gag teg gtg cga ate ggc ctg gag 192 
Arg Cys Asp Pro Leu Ser Leu Asp Glu Ser Val Arg lie Gly Leu Glu 
50 55 60 
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ate act tct ggc get cag ctg gtc egg aga acegtt gag etc gat cac 240 
He Thr Ser Gly Ala Gin Leu Val Arg Arg Thr Val Glu Leu Asp His 
65 70 75 80 

gca ggc ctg egg etc gcg gcg gtc gee gaa gca get get gtt etc egg 288 
Ala Gly Leu Arg Leu Ala Ala Val Ala Glu Ala Ala Ala Val Leu Arg 
85 90 95 

ttc gac gcg gtg gat ctg ctg gaa ggg etc ttc ggc ccg gtt gac ggc 336 
Phe Asp Ala Val Asp Leu Leu Glu Gly Leu Phe Gly Pro Val Asp Gly 
100 105 110 

agg egg cac aac age cgt gaa gtc cgc tgg teg gac age atg acg cag 384 
Arg Arg His Asn Ser Arg Glu Val Arg Trp Ser Asp Ser Met Thr Gin 
115 120 125 

ttc teg ccc gac cag ggc etc gee ggc gcg cag cgc ctg ctg gcg ttc 432 
Phe Ser Pro Asp Gin Gly Leu Ala Gly Ala Gin Arg Leu Leu Ala Phe 
130 135 140 

egg aac agg gtg tec ace gcg gtg cac gee gtg ctg gec gca gec gee 4 80 
Arg Asn Arg Val Ser Thr Ala Val His Ala Val Leu Ala Ala Ala Ala 
145 150 155 160 

ace agg cgc gcg gac etc ggt gcg ctg gca gtc cgc tac gga tec gac 528 
Thr Arg Arg Ala Asp Leu Gly Ala Leu Ala Val Arg Tyr Gly Ser Asp 
165 170 175 

aaa tgg gcg gac ctg cac tgg tac ace gaa cac tac gag cac cac ttc 576 
Lys Trp Ala Asp Leu His Trp Tyr Thr Glu His Tyr Glu His His Phe 
180 185 190 

tec cga ttc cag gat gee ccg gtg cga gtg ttg gaa ata gga ate ggt 624 
Ser Arg Phe Gin Asp Ala Pro Val Arg Val Leu Glu He Gly He Gly 
195 200 205 

ggt tat cac gca ccc gaa etc ggt ggt get teg ctg cgc atg tgg cag 672 
Gly Tyr His Ala Pro Glu Leu Gly Gly Ala Ser Leu Arg Met Trp Gin 
210 215 220 

egg tac ttc egg cga ggt etc gtt tac ggg ctg gac att ttc gag aaa 720 
Arg Tyr Phe Arg Arg Gly Leu Val Tyr Gly Leu Asp He Phe Glu Lys 
225 230 235 240 

gec ggg aac gaa ggg cac cga gtg cga aag ctg cga ggt gac cag age 768 
Ala Gly Asn Glu Gly His Arg Val Arg Lys Leu Arg Gly Asp Gin Ser 
245 250 255 
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gat gcg gaa ttc ctg gaa gac atg gcg ggg aag ate ggg ccg ttc gac 816 
Asp Ala Glu Phe Leu Glu Asp Met Ala Gly Lys lie Gly Pro Phe Asp 
260 265 270 

att gtc ate gac gac ggc age cat gtc aac gac cac gtc aag aaa tec 864 
He Val He Asp Asp Gly Ser His Val Asn Asp His Val Lys Lys Ser 
275 280 285 

ttc caa tec ctg ttt ccg cac gtc cgc cca ggt ggt ttg tac gtc ate 912 
Phe Gin Ser Leu Phe Pro His Val Arg Pro Gly Gly Leu Tyr Val He 
290 . 295 300 

gag gat etc cag acg gcg tac tgg ccc ggc tac ggc ggt cgc gat ggg 960 
Glu Asp Leu Gin Thr Ala Tyr Trp Pro Gly Tyr Gly Gly Arg Asp Gly 
305 310 315 . 320 

gaa ccc gcg gee cag cgc ace teg ate gac atg etc aaa gaa ctg ate 1008 
Glu Pro Ala Ala Gin Arg Thr Ser He Asp Met Leu Lys Glu Leu He 
325 330 335 

gac ggc ctg cat tat cag gag cgc gaa teg egg tgc ggg acc gag ccc 1056 
Asp Gly Leu His Tyr Gin Glu Arg Glu Ser Arg Cys Gly Thr Glu Pro 
340 345 350 

tec tac acg gaa egg aac gtg gcg gee ctg cac ttc tac cac aac ctg 1104 
Ser Tyr Thr Glu Arg Asn Val Ala Ala Leu His Phe Tyr His Asn Leu 
355 360 365 

gta ttc gtg gag aaa ggg etc aac get gag cct gee gcg ccg ggg ttc 1152 
Val Phe Val Glu Lys Gly Leu Asn Ala Glu Pro Ala Ala Pro Gly Phe 
370 375 380 

gtg ccc egg caa gcg etc ggc gtc gag gac ggc tga 1188 
Val Pro Arg Gin Ala Leu Gly Val Glu Asp Gly 
385 390 395 



<210> 14 
<211> 395 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 14 

Met Ser Glu He Ala Val Ala Pro Trp Ser Val Val Glu Arg Leu Leu 
15 10 15 

Leu Ala Ala Gly Ala Gly Pro Ala Lys Leu Gin Glu Ala Val Gin Val 
20 25 30 
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Ala Gly Leu Asp Ala Val Ala Asp Ala He Val Asp Glu Leu Val Val 
35 40 45 

Arg Cys Asp Pro Leu Ser Leu Asp Glu Ser Val Arg He Gly Leu Glu 
50 55 60 

He Thr Ser Gly Ala Gin Leu Val Arg Arg Thr Val Glu Leu Asp His 
65 70 75 80 



Ala Gly Leu Arg Leu Ala Ala Val Ala Glu Ala Ala Ala Val Leu Arg 
85 90 95 

Phe Asp Ala Val Asp Leu Leu Glu Gly Leu Phe Gly Pro Val Asp Gly 
100 105 110 

Arg Arg His Asn Ser Arg Glu Val Arg Trp Ser Asp Ser Met Thr Gin 
115 120 125 



Phe Ser Pro Asp Gin Gly Leu Ala Gly Ala Gin Arg Leu Leu Ala Phe 
130 135 140 

Arg Asn Arg Val Ser Thr Ala Val His Ala Val Leu Ala Ala Ala Ala 
145 150 155 160 

Thr Arg Arg Ala Asp Leu Gly Ala Leu Ala Val Arg Tyr Gly Ser Asp 
165 170 175 

Lys Trp Ala Asp Leu His Trp Tyr Thr Glu His Tyr Glu His His Phe 
180 185 190 

Ser Arg Phe Gin Asp Ala Pro Val Arg Val Leu Glu He Gly He Gly 
195 200 205 

Gly Tyr His Ala Pro Glu Leu Gly Gly Ala Ser Leu Arg Met Trp Gin 
210 215 220 



Arg Tyr Phe Arg Arg Gly Leu Val 
225 230 

Ala Gly Asn Glu Gly His Arg Val 
245 

Asp Ala Glu Phe Leu Glu Asp Met 
260 

He Val He Asp Asp Gly Ser His 
275 280 



Tyr Gly Leu Asp He Phe Glu Lys 
235 240 

Arg Lys Leu Arg Gly Asp Gin Ser 
250 255 

Ala Gly Lys He Gly Pro Phe Asp 
265 270 

Val Asn Asp His Val Lys Lys Ser 
285 
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Phe Gin Ser Leu Phe Pro His Val Arg Pro Gly Gly Leu Tyr Val He 
290 295 300 

Glu Asp Leu Gin Thr Ala Tyr Trp Pro Gly Tyr Gly Gly Arg Asp Gly 
305 310 315 320 

Glu Pro Ala Ala Gin Arg Thr Ser He Asp Met Leu Lys Glu Leu He 
325 330 335 

Asp Gly Leu His Tyr Gin Glu Arg Glu Ser Arg Cys Gly Thr Glu Pro 
340 345 350 

Ser Tyr Thr Glu Arg Asn Val Ala Ala Leu His Phe Tyr His Asn Leu 
355 360 365 

Val Phe Val Glu Lys Gly Leu Asn Ala Glu Pro Ala Ala Pro Gly Phe 
370 375 380 

Val Pro Arg Gin Ala Leu Gly Val ' Glu Asp Gly 
385 390 395 



<210> 15 
<211> 1620 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1617) 

<223> 0RF5; C-C verknupf endes Enzym, Cyclisierungsenzym 
<400> 15 

atg ate teg get gcg ggc gaa caa agt gga cca gtc aga aaa gga ggg 48 
Met He Ser Ala Ala Gly Glu Gin Ser Gly Pro Val Arg Lys Gly Gly 
1 5 10 15 

gcg gtg ccc gaa ttc cat gac ccg gca ccc atg aat cgt cga acc cca 96 
Ala Val Pro Glu Phe His Asp Pro Ala Pro Met Asn Arg Arg Thr Pro 
20 25 30 

gga aca gag ate acc gtc gag ccc gac gat cct cgt tat ccg gac etc 144 
Gly Thr Glu He Thr Val Glu Pro Asp Asp Pro Arg Tyr Pro Asp Leu 
35 40 45 



gtc gtc ggg cac aac ccc cgt ttc acc gga aaa ccc gaa cgc ate cac 192 
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Val Val Gly His Asn Pro Arg Phe Thr Gly Lys Pro Glu Arg He His 
50 55 60 

ate gec age tec gee gaa gac gtc gtg cac gec gtc gee gac gec gtg 240 
He Ala Ser Ser Ala Glu Asp Val Val His Ala Val Ala Asp Ala Val 
65 70 75 80 

cgc ace ggc agg egg gta ggg gtc cgc age ggc ggg cac tgc ttc gag 2 88 
Arg Thr Gly Arg Arg Val Gly Val Arg Ser Gly Gly His Cys Phe Glu 
85 90 95 

aat etc gtt gcg gac ccg gcg ate cga gtg etc gtc gac etc tec gag 336 
Asn Leu Val Ala Asp Pro Ala He Arg Val Leu Val Asp Leu Ser Glu 
100 105 110 

etc aac cgc gtg tac tac gac age acg cgc ggg gca ttc gcg ate gag 384 
Leu Asn Arg Val Tyr Tyr Asp Ser Thr Arg Gly Ala Phe Ala He Glu 
115 120 125 

gcg ggc gec gee etc ggg cag gtg tac cga ace ctg ttc aag aac tgg 432 
Ala Gly Ala Ala Leu Gly Gin Val Tyr Arg Thr Leu Phe Lys Asn Trp 
130 135 140 . 

ggc gtg acg ate ccg ace ggc gca tgt ccc ggg gtg ggc gca ggc ggg 480 
Gly Val Thr He Pro Thr Gly Ala Cys Pro Gly Val Gly Ala Gly Gly 
145 150- 155 160 

cac ate etc ggc ggg gga tac ggc ccg ctg teg cgc cga ttc ggt teg 528 
His He Leu Gly Gly Gly Tyr Gly Pro Leu Ser Arg Arg Phe Gly Ser 
165 170 ' 175 

gtc gtc gac tac ctt caa ggc gtc gag gtc gtc gtg gtc gac cag gee 576 
Val Val Asp Tyr Leu Gin Gly Val Glu Val Val Val Val Asp Gin Ala 
180 185 190 

ggt gaa gtg cac ate gtc gag gee gac egg aac tec acg ggc gec ggt 624 
Gly Glu Val His He Val Glu Ala Asp Arg Asn Ser Thr Gly Ala Gly 
195 200 205 

cac gac ttg tgg tgg gcg cac ace ggt ggc ggt ggc ggc aac ttc ggg 672 
His Asp Leu Trp Trp Ala His Thr Gly Gly Gly Gly Gly Asn Phe Gly 
210 215 220 

ate gtc ace agg ttt tgg etc cga acg ccg gac gtg gtc age acc gac 720 
He Val Thr Arg Phe Trp Leu Arg Thr Pro Asp Val Val Ser Thr Asp 
225 230 235 240 

gee gca gag etc ctg cca egg ccg ccc gcg aca gtg ctg etc cga teg 768 
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Ala Ala Glu Leu Leu Pro Arg Pro Pro Ala Thr Val Leu Leu Arg Ser 
245 250 255 

ttc cac tgg ccg tgg cac gaa ctg aca gag cag tea ttc gec gtg etc 816 
Phe His Trp Pro Trp His *Glu Leu Thr Glu Gin Ser Phe Ala Val Leu 
260 265 270 

eta cag aac ttc ggc aat tgg tac gag cag cac age gcg cct gaa tec 864 
Leu Gin Asn Phe Gly Asn Trp Tyr Glu Gin His Ser Ala Pro Glu Ser 
275 280 285 

acg caa etc ggg ttg ttc age acg etc gtc tgc gca cac egg caa get 912 
Thr Gin Leu Gly Leu Phe Ser Thr Leu Val Cys Ala His Arg Gin Ala 
290 295 300 

ggc tac gtc acg ctg aac gtt cac ctg gac ggc acg gat ccg aac gcg 960 
Gly Tyr Val Thr Leu Asn Val His Leu Asp Gly Thr Asp Pro Asn Ala 
305 310 315 320 

gaa cgc ace ctg gec gaa cac ctg teg gcg ate aac gee cag gtc ggc 1008 
Glu Arg Thr Leu Ala Glu His Leu Ser Ala He Asn Ala Gin Val Gly 
325 330 335 

gtg act cca gec gaa ggg ctg egg gaa ace ctg ccg tgg ttg cga teg 1056 
Val Thr Pro Ala Glu Gly Leu Arg Glu Thr Leu Pro Trp Leu Arg Ser 
340 345 350 

ace cag gtg gee ggg gcg ate gee gaa ggc ggc gaa ccg ggc atg caa 1104 
Thr Gin Val Ala Gly Ala He Ala Glu Gly Gly Glu Pro Gly Met Gin 
355 360 365 

egg acc aag gtc aaa gee gec tac ttg cgc ace ggg ctg tec gaa get 1152 
Arg Thr Lys Val Lys Ala Ala Tyr Leu Arg Thr Gly Leu Ser Glu Ala 
370 375 380 

caa eta gec acg gtt tac egg egg ctg acc gtc tac gga tac gac aac 1200 
Gin Leu Ala Thr Val Tyr Arg Arg Leu Thr Val Tyr Gly Tyr Asp Asn 
385 390 395 400 

cct gcg gcg gcg ctg ttg ctg etc ggt tac ggc ggt atg gcg aat gee 1248 
Pro Ala Ala Ala Leu Leu Leu Leu Gly Tyr Gly Gly Met Ala Asn Ala 
405 410 415 

gtg get ccg teg gec acc gca etc get cag cgc gac teg gtt etc aaa 1296 
Val Ala Pro Ser Ala Thr Ala Leu Ala Gin Arg Asp Ser Val Leu Lys 
420 425 430 

gcg ctg ttc gtc acg aac tgg teg gag ccc'gcc gag gac gag egg cat 1344 
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Ala Leu Phe Val Thr Asn Trp Ser Glu Pro Ala Glu Asp Glu Arg His 
435 440 445 

ctg acc tgg att cgc ggt ttc tac cgc gag atg tac gcc gaa acc ggc 13 92 
Leu Thr Trp He Arg Gly Phe Tyr Arg Glu Met Tyr Ala Glu Thr Gly 
450' 455 460 



gga gtt ccg gtg cca ggt acc cgt gtc gac ggc tec tac ate aac tac 1440 
Gly Val Pro Val Pro Gly Thr Arg Val Asp Gly Ser Tyr He Asn Tyr 
465 470 475 480 



ccg gac acc gac ttg gcc gat cca ttg tgg aac acc tec ggt gtt gcc i4 86 
Pro Asp Thr Asp Leu Ala Asp Pro Leu Trp Asn Thr Ser Gly Val Ala 
485 '490 495 



tgg cac gac ctg tac tac aaa gac aac tac ccg egg ctg cag egg gcc 1536 
Trp His Asp Leu Tyr Tyr Lys Asp Asn Tyr Pro Arg Leu Gin Arg Ala 
500 505 510 

aaa gcg egg tgg gat ccg cag aac ate ttc cag cac ggc ctg teg ate 1584 
Lys Ala Arg Trp Asp Pro Gin Asn He Phe Gin His Gly Leu Ser He 
515 520 525 

aaa ccg ccg gca egg ctt tea ccc ggt cag cca tga 1620 
Lys Pro Pro Ala Arg Leu Ser Pro Gly Gin Pro 
530 535 



<210> 16 
<211> 539 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 16 

Met lie Ser Ala Ala Gly Glu Gin Ser Gly Pro Val Arg Lys Gly Gly 
15 10 15 

Ala Val Pro Glu Phe His Asp Pro Ala Pro Met Asn Arg Arg Thr Pro 
20 .25 30 

Gly Thr Glu He Thr Val Glu Pro Asp Asp Pro Arg Tyr Pro Asp Leu 
35 40 45 



Val Val Gly His Asn Pro Arg Phe Thr Gly Lys Pro Glu Arg He His 

50 55 60 

He Ala Ser Ser Ala Glu Asp Val Val His Ala Val Ala Asp Ala Val 

65 . 70 75 80 
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Arg Thr Gly Arg Arg Val Gly Val Arg Ser Gly Gly His Cys Phe Glu 
85 90 95 

Asn Leu Val Ala Asp Pro Ala lie Arg Val Leu Val Asp Leu Ser* Glu 
100 * 105 110 

Leu Asn Arg Val Tyr Tyr Asp Ser Thr Arg Gly Ala Phe Ala lie Glu 
115 120 125 

Ala Gly Ala Ala Leu Gly Gin Val Tyr Arg Thr Leu Phe Lys Asn Trp 
130 135 140 

Gly Val Thr He Pro Thr Gly Ala Cys Pro Gly Val Gly Ala Gly Gly 
145 150 155 160 

His He Leu Gly Gly Gly Tyr Gly Pro Leu Ser Arg Arg Phe Gly Ser 
165 170 175 

Val Val Asp Tyr Leu Gin Gly Val Glu Val Val Val Val Asp Gin Ala 
180 185 190 

Gly Glu Val His He Val Glu Ala Asp Arg Asn Ser Thr Gly Ala Gly 
195 200 205 

His Asp Leu Trp Trp Ala His Thr Gly Gly Gly Gly Gly Asn Phe Gly 
210 215 220 

He Val Thr Arg Phe Trp Leu Arg Thr Pro Asp Val Val Ser Thr Asp 
225 230 235 240 

Ala Ala Glu Leu Leu Pro Arg Pro Pro Ala Thr Val Leu Leu Arg Ser 
245 250 255 

Phe His Trp Pro Trp His Glu Leu Thr Glu Gin Ser Phe Ala Val Leu 
260 265 270 

Leu Gin Asn Phe Gly Asn Trp Tyr Glu Gin His Ser Ala Pro Glu Ser 
275 280 285 

Thr Gin Leu Gly Leu Phe Ser Thr Leu Val Cys Ala His Arg Gin Ala 
290 295 300 

Gly Tyr Val Thr Leu Asn Val His Leu Asp Gly Thr Asp Pro Asn Ala 
305 310 315 320 

Glu Arg Thr Leu Ala Glu His Leu Ser Ala He Asn Ala Gin Val Gly 
325 330 335 
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Val Thr Pro Ala Glu Gly Leu Arg Glu Thr Leu Pro Trp Leu Arg Ser 
340 345 350 

Thr Gin Val Ala Gly Ala He Ala Glu Gly Gly Glu Pro Gly Met Gin 
355 360 365 

Arg Thr Lys Val Lys Ala Ala Tyr Leu Arg Thr Gly Leu Ser Glu Ala 
370 375 380 

Gin Leu Ala Thr Val Tyr Arg Arg Leu Thr Val Tyr Gly Tyr Asp Asn 
385 390 395 400 

Pro Ala Ala Ala Leu Leu Leu Leu Gly Tyr Gly Gly Met Ala Asn Ala 
405 410 415 

Val Ala Pro Ser Ala Thr Ala Leu Ala Gin Arg Asp Ser Val Leu Lys 
420 425 430 

Ala Leu Phe Val Thr Asn Trp Ser Glu Pro Ala Glu Asp Glu Arg His 
435 440 445 

Leu Thr Trp He Arg Gly Phe Tyr Arg Glu Met Tyr Ala Glu Thr Gly 
450 455 460 

Gly Val Pro Val Pro Gly Thr Arg Val Asp Gly Ser Tyr He Asn Tyr 
465 470 475 480 

Pro Asp Thr Asp Leu Ala Asp Pro Leu Trp Asn Thr Ser Gly Val Ala 
485 490 495 

Trp His Asp Leu Tyr Tyr Lys Asp Asn Tyr Pro Arg Leu Gin Arg Ala 
500 505 510 

Lys Ala Arg Trp Asp Pro Gin Asn He Phe Gin His Gly Leu Ser He 
515 520 525 



Lys Pro Pro Ala Arg Leu Ser Pro Gly Gin Pro 
530 535 



<210> 17 
<211> 1194 
<212> DNA 

<213> Saccharopolyspora spinosa 
<220> 
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<221> CDS 

<222> (1)..(1191) 

<223> ORF6; Methyl transferase 

<400> 17 

atg tec aca acg cac gag ate gaa acc gtg gaa cgc ate ate etc gee 48 

Met Ser Thr Thr His Glu He Glu Thr Val Glu Arg He He Leu Ala 
1 5 10 15 

gee gga tec agt gcg gcg age ctg gee gac ctg acc acc gaa etc gga 96 
Ala Gly Ser Ser Ala Ala Ser Leu Ala Asp Leu Thr Thr Glu Leu Gly 
20 25 30" 

etc gee agg ate gca ccc gtg ctg ate gac gag ate etc ttc cgc gcg 144 
Leu Ala Arg He Ala Pro Val Leu He Asp Glu He Leu Phe Arg Ala 
35 40 45 

gaa ccg gee ccc gac ate gaa egg acc gag gtc gcg gtc cag ate acc 192 
Glu Pro Ala Pro Asp He Glu Arg Thr Glu Val Ala Val Gin He Thr 
50 55 60 

cac cga ggc gag acc gtt gac ttc gtc ctg acg eta cag tec ggt gag 240 
His Arg Gly Glu Thr Val Asp Phe Val Leu Thr Leu Gin Ser Gly Glu 
65 70 75 80 

ctg ate aag gee gag caa cga ccg gtc gga gac gtc ccg ctg egg ate 2 88 
Leu He Lys Ala Glu Gin Arg Pro Val Gly Asp Val Pro Leu Arg He 
85 90 95 

ggt tac gag etc acc gat etc ate gee gag ttg ttc ggc cca gga get 336 
Gly Tyr Glu Leu Thr Asp Leu He Ala Glu Leu Phe Gly Pro Gly Ala 
100 105 110 

ccc agg gee gtc ggc gee egg age acc aac ttc etc cga acc acc aca 3 84 
Pro Arg Ala Val Gly Ala Arg Ser Thr Asn Phe Leu Arg Thr Thr Thr 
115 120 . 125 

tec ggt teg ata ccc ggt ccg teg gaa ctg tec gat ggc ttc cag gee 432 
Ser Gly Ser He Pro Gly Pro Ser Glu Leu Ser Asp Gly Phe Gin Ala 
130 135 140 

ate tec gca gtg gtc gec ggc tgc ggg cac cga cgt ccc gac etc aac 4 80 
He Ser Ala Val Val Ala Gly Cys Gly His Arg Arg Pro Asp Leu Asn 
145 150 155 160 

ttg etc gee tec cac tac cgc acg gac aag tgg ggc ggc ctg cac tgg 528 
Leu Leu Ala Ser His Tyr Arg Thr Asp Lys Trp Gly Gly Leu His Trp 
165 170 175 
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ttc acc ccg eta tac gag cga cac etc ggc gag ttc cgt gat cgc ccg 576 
Phe Thr Pro Leu Tyr Glu Arg His Leu Gly Glu Phe Arg Asp Arg Pro 
180 185 190 

gtg cgc ate ctg gag ate ggt gtc ggt ggc tac aac ttc gac ggt ggc 624 
Val Arg lie Leu Glu He Gly Val Gly Gly Tyr Asn Phe Asp Gly Gly 
195 200 205 

ggc. ggc gaa tec ctg aag atg tgg aag cgc tac ttc cac cgc ggc etc 672 
Gly" Gly Glu Ser Leu Lys Met Trp Lys Arg Tyr Phe His Arg Gly Leu 
210 215 220 

gtg ttc ggg atg gac gtt ttc gac aag tec ttc etc gac cag cag agg 720 
Val Phe Gly Met Asp Val Phe Asp Lys Ser Phe Leu Asp Gin Gin Arg 
225 230 235 240 

etc tgc acc gtc cgc gee gac cag age aag ccc gag gag ctg gee gec 768 
Leu Cys Thr Val Arg Ala Asp Gin Ser Lys Pro Glu Glu Leu Ala Ala 
245 250 255 

gtt gac gac aag tac gga ccg ttc gac ate ate ate gac gat ggc age 816 
Val Asp Asp Lys Tyr Gly Pro Phe Asp He He He Asp Asp Gly Ser 
260 265 270 

cac ate aac gga cac gtg cgc aca tec ctg gaa acg ctg ttc ccc egg 864 
His He Asn Gly His Val Arg Thr Ser Leu Glu Thr Leu Phe Pro Arg 
275 280 285 

ttg cgc age ggt ggc gta tac gtg ate gag gat ctg tgg acg acc tat 912 
Leu Arg Ser Gly Gly Val Tyr Val He Glu Asp Leu Trp Thr Thr Tyr 
290 295 300 

get ccc gga ttc ggc ggg cag gcg cag tgc ccg gee gca ccc ggc acc 960 
Ala Pro Gly Phe Gly Gly Gin Ala Gin Cys Pro Ala Ala Pro Gly Thr 
305 . 310 315 320 

acg gtc age ctg etc aag aac ctg ttg gaa ggc gtt cag cac gag gag 1008 
Thr Val Ser Leu Leu Lys Asn Leu Leu Glu Gly Val Gin His Glu Glu 
325 330 335 

cag ccg cat gcg ggc teg tac gag ccg age tac ctg gaa cgc aat ttg 1056 
Gin Pro His Ala Gly Ser Tyr Glu Pro Ser Tyr Leu Glu Arg Asn Leu 
340 345 350 

gtc ggc etc cac acc tac cac aac ate gcg ttc ctg gag aaa ggc gtc 1104 
Val Gly Leu His Thr Tyr His Asn He Ala Phe Leu Glu Lys Gly Val 
355 360 365 
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aac gcc gaa ggc ggc gtt cct get tgg gtg cca agg agt ctg gac gac 1152 
Asn Ala Glu Gly Gly Val Pro Ala Trp Val Pro Arg Ser Leu Asp Asp 
370 375 380 

ata ttg cac ctg gcc gac gtg aac age gcg gag gac gag tga 1194 
He Leu His Leu Ala Asp Val Asn Ser Ala Glu Asp Glu 
385 390 395 



<210> 18 
<211> 397 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 18 

Met Ser Thr Thr His Glu He Glu Thr Val Glu Arg He He Leu Ala 
15 10 15 

Ala Gly Ser Ser Ala Ala Ser Leu Ala Asp Leu Thr Thr Glu Leu Gly 
20 25 30 

Leu Ala Arg He Ala Pro Val Leu He Asp Glu He Leu Phe Arg Ala 
35 40 45 

Glu Pro Ala Pro Asp He Glu Arg Thr Glu Val Ala Val Gin He Thr 
50 55 60 

His Arg Gly Glu Thr Val Asp Phe Val Leu Thr Leu Gin Ser Gly Glu 
65 70 75 80 

Leu He Lys Ala Glu Gin Arg Pro Val Gly Asp Val Pro Leu Arg He 
85 90 95 

Gly Tyr Glu Leu Thr Asp Leu He Ala Glu Leu Phe Gly Pro Gly Ala 
100 105 110 

Pro Arg Ala Val Gly Ala Arg Ser Thr Asn Phe Leu Arg Thr Thr Thr 
115 120 125 

Ser Gly Ser He Pro Gly Pro Ser Glu Leu Ser Asp Gly Phe Gin Ala 
130 135 140 

He Ser Ala Val Val Ala Gly Cys Gly His Arg Arg Pro Asp Leu Asn 
145 150 155 160 

Leu Leu Ala Ser His Tyr Arg Thr Asp Lys Trp Gly Gly Leu His Trp 
165 170 175 
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Phe Thr Pro Leu Tyr Glu Arg His Leu Gly Glu Phe Arg Asp Arg Pro 
180 185 190 

Val Arg lie Leu Glu lie Gly Val Gly Gly Tyr Asn Phe Asp Gly Gly 
195 200 205. 

Gly Gly Glu Ser Leu Lys Met Trp Lys Arg Tyr Phe His Arg Gly Leu 
210 215 220 

Val Phe Gly Met Asp Val Phe Asp Lys Ser Phe Leu Asp Gin Gin Arg 
225 230 235 240 

Leu Cys Thr Val Arg Ala Asp Gin Ser Lys Pro Glu Glu Leu Ala Ala 
245 250 255 

Val Asp Asp Lys Tyr Gly Pro Phe Asp lie lie He Asp Asp Gly Ser 
260 265 270 

His He Asn Gly His Val Arg Thr Ser Leu Glu Thr Leu Phe Pro Arg 
275 280 285 

Leu Arg Ser Gly Gly Val Tyr Val He Glu Asp Leu Trp Thr Thr Tyr 
290 . 295 300 

Ala Pro Gly Phe Gly Gly Gin Ala Gin Cys Pro Ala Ala Pro Gly Thr 
305 310 315 320 

Thr Val Ser Leu Leu Lys Asn Leu Leu Glu Gly Val Gin His Glu Glu 
325 330 335 

Gin Pro His Ala Gly Ser Tyr Glu Pro Ser Tyr Leu Glu Arg Asn Leu 
340 345 350 

Val Gly Leu His Thr Tyr His Asn He Ala Phe Leu Glu Lys Gly Val 
355 360 365 

Asn Ala Glu Gly Gly Val Pro Ala Trp Val Pro Arg Ser Leu Asp Asp 
370 375 380 

lie Leu His Leu Ala Asp Val Asn Ser Ala Glu Asp Glu 
385 390 395 



<210> 19 
<211> 900 
<212> DNA 
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<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) • ■ (897) 

<223> ORF7; O-Methyltransf erase 

<400> 19 

gtg aac age aga ggg gcg aac aea cag gca ttt ccg acc gcg gat cag 4 8 

Val Asn Ser Arg Gly Ala Asn Thr Gin Ala Phe Pro Thr Ala Asp Gin 
15 10 .15 

gtg gag tec ate ttc gat gcg ttg gcg cac ggg cgt ccc ctg cac cac 96 
Val Glu Ser lie Phe Asp Ala Leu Ala His Gly Arg Pro Leu His His 
20 . 25 30 

ggt tac tgg gcg ggc ggg tat egg gag gat gee ggt gee aca ccg tgg 144 
Gly Tyr Trp Ala Gly Gly Tyr Arg Glu Asp Ala Gly Ala Thr Pro Trp 
35 40 45 

teg gat get gee gac caa ctg acc gac ctg ttc ate gac aag gec gcg 192 
Ser Asp Ala Ala Asp Gin Leu Thr Asp Leu Phe lie Asp Lys Ala Ala 
50 55 60 

etc cgt ccc gga gcg cac ctg ttc gac ctg ggc tgc ggc aat'ggg cag 240 
Leu Arg Pro Gly Ala His Leu Phe Asp Leu Gly Cys Gly Asn Gly Gin 
65 70 75 . 80 

ccc gta gtc cgt gcg gca tgc gee age ggc gtt cga gtc acc gga ate 288 
Pro Val Val Arg Ala Ala Cys Ala Ser Gly Val Arg Val Thr Gly He 
85 90 95 

acc gtg aac gee cag cat etc gee gec gee acc agg etc gee aac gag 336 
Thr Val Asn Ala Gin His Leu Ala Ala Ala Thr Arg Leu Ala Asn Glu 
100 105 110 

acc gga ctg gee ggc agt ctt gag ttc gat eta gtc gac ggc gec cag 3 84 
Thr Gly Leu Ala Gly Ser Leu Glu Phe Asp Leu Val Asp Gly Ala Gin 
115 120 125 

ctg ccc tac ccg gac ggt ttc ttt cag gee gca tgg gcg atg cag tec 432 
Leu Pro Tyr Pro Asp Gly Phe Phe Gin Ala Ala Trp Ala Met Gin Ser 
130 135 140 

gtc gtg cag ate gtg gac cag gee gee gcg ate cgc gag gtc cac cga 480- 
Val Val Gin He Val Asp Gin. Ala Ala Ala He Arg Glu Val His Arg 
145 150 155 160 
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ate ctg gaa ccc ggc ggc egg ttc gtc etc gga gac ate ate act egg 528 

lie Leu Glu Pro Gly Gly Arg Phe Val Leu Gly Asp lie" lie Thr Arg 
165 170 175 

gtt cga etc ccg gaa gag tac gcg gcg gtt tgg acg ggc acg ace gee 576 

Val Arg Leu Pro Glu Glu Tyr Ala Ala Val Trp Thr Gly Thr Thr Ala 
180 185 190 

cat acc ttg aac age ttc acg gcg ctg gtc age gaa gee ggg ttc gag 624 ■ 

His Thr Leu Asn Ser Phe Thr Ala Leu Val Ser Glu Ala Gly Phe Glu 
195 200 205 

att etc gaa gtc acc gac etc acg gca cag acc agg tgc atg gtc tec 672 • 

lie Leu Glu Val Thr Asp Leu Thr Ala Gin Thr Arg Cys Met Val Ser 
210 215 220 



tgg tac gtc gac gag ttg etc egg aaa 
Trp Tyr Val Asp Glu Leu Leu Arg Lys 
225 230 

gag cct gcg get gtc ggc acc tac cag 
Glu Pro Ala Ala Val Gly Thr Tyr Gin 
245 

gcg gcg aag cac gga ccg gga cca gca 
Ala Ala Lys His Gly Pro Gly Pro Ala 
260 265 

gaa tac egg aaa cat ccg gat tac gee 
Glu Tyr Arg Lys His Pro Asp Tyr Ala 
275 280 

ttc atg etc ctg cag get cga aag aag 
Phe Met Leu Leu Gin Ala Arg Lys Lys 
290 '295 



etc gat gag etc gee ggc gtc 720 
Leu Asp Glu Leu Ala Gly Val 
235 240 

caa cgc tac ttg gga gac ate 768 
Gin Arg Tyr Leu Gly Asp He 
250 255 

cag ctg ate gee gcg gtt gcg 816 
Gin Leu He Ala Ala Val Ala 
270 

aga aac gag gaa age atg ggt 864 
Arg Asn Glu Glu Ser Met Gly 
285 

cag tec tga 900 
Gin Ser 



<210> 20 
<211> 299 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 20 

Val Asn Ser Arg Gly Ala Asn Thr Gin Ala Phe Pro Thr Ala Asp Gin 
1.5 10 15 

Val Glu Ser He Phe Asp Ala Leu Ala His Gly Arg Pro Leu His His 
20 25 30 
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Gly Tyr Trp Ala Gly Gly Tyr Arg Glu Asp Ala Gly Ala Thr Pro Trp 
35 40 45 

Ser Asp Ala Ala Asp Gin Leu Thr Asp Leu Phe lie Asp Lys Ala Ala 
50 55 60 

Leu Arg Pro Gly Ala His Leu Phe Asp Leu Gly Cys Gly Asn Gly Gin 
65 70 75 80 

Pro Val Val Arg Ala Ala Cys Ala Ser Gly Val Arg Val Thr Gly lie 
85 90 95 

Thr Val Asn Ala Gin His Leu Ala Ala Ala Thr Arg Leu Ala Asn Glu 
100 105 110 

Thr Gly Leu Ala Gly Ser Leu Glu Phe Asp Leu Val Asp Gly Ala Gin 
115 120 125 

Leu Pro Tyr Pro Asp Gly Phe Phe Gin Ala Ala Trp Ala Met Gin Ser 
130 135 140 

Val Val Gin lie Val Asp Gin Ala Ala Ala lie Arg Glu Val . His Arg 
145 150 155 160 

lie Leu Glu Pro Gly Gly Arg Phe Val Leu Gly Asp lie lie Thr Arg 
165 170 175 

Val Arg Leu Pro Glu Glu Tyr Ala Ala Val Trp Thr Gly Thr' Thr Ala 
180 185 190 

His Thr Leu Asn Ser Phe Thr Ala Leu Val Ser Glu Ala Gly Phe Glu 
195 200 205 

lie Leu Glu Val Thr Asp Leu Thr Ala Gin Thr Arg Cys Met Val Ser 
210 215 220 

Trp Tyr Val Asp Glu Leu Leu Arg Lys Leu Asp Glu Leu Ala Gly Val 
225 230 235 240 

Glu Pro Ala Ala Val Gly Thr Tyr Gin Gin Arg Tyr Leu Gly Asp lie 
245 250 255 

Ala Ala Lys His Gly Pro Gly Pro Ala Gin Leu He Ala Ala Val Ala 
260 265 270 



Glu Tyr Arg Lys His Pro Asp Tyr Ala Arg Asn Glu Glu Ser Met Gly 
275 280 285 
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Phe Met Leu Leu Gin Ala Arg Lys Lys Gin Ser 
290 295 



<210> 21 
<211> 1167 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1164) 

<223> ORF8; Cyclisierungsenzym 



<400> 21 

atg gcc tec gag cac gec age ctg gtc ggc gac gat ctg egg gca ccc 48 
Met Ala Ser Glu His Ala Ser Leu Val Gly Asp Asp Leu Arg Ala Pro 
1 5 10 15 

gcg gat gat ccc ttc tac cga ccg ccg acg ccg eta ccg ccg ggt gtc 96 
Ala Asp Asp Pro Phe Tyr Arg Pro Pro Thr Pro Leu Pro Pro Gly Val 
20 25 30 

ccg ggc acg etc etc agg gcc egg ccc gtc teg gca ctg cgc ggc acg 144 
Pro Gly Thr Leu Leu Arg Ala Arg Pro Val Ser Ala Leu Arg Gly Thr 
35 40 45 



ggc gaa ccc gtc gca gcc aag gcc tgg caa ate etc tac egg tec aac 192 
Gly Glu Pro Val Ala Ala Lys Ala Trp Gin lie Leu Tyr Arg Ser Asn 
50 55 60 

tec gcc ctt ggc atg ccg aac gcc gtc tec ggc ace gtt ctg gtg ccg 24 0 
Ser Ala Leu Gly Met Pro Asn Ala Val Ser Gly Thr Val Leu Val Pro 
65 70 75 80 



aac ate ccg tgg ccg cgc gaa gat cgc ccc ate ate act ttc gca gtg 288 
Asn lie Pro Trp Pro Arg Glu Asp Arg Pro He He Thr Phe Ala Val 
85 90 95 



ggc acc cac ggc etc ggt age caa gtt gcc ccg teg tac ctg ctt cga 336 
Gly Thr His Gly Leu Gly Ser Gin Val Ala Pro Ser Tyr Leu Leu Arg 
100 105 HO 



acc gga acc gag ccg gag acc gag ctg ate gcc gtg gcc etc gac cgc 3 84 
Thr Gly Thr Glu Pro Glu Thr Glu Leu He Ala Val Ala Leu Asp Arg 
115 120 125 
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999 fc 99 9cc gtg gtc ate acc gac tac gaa ggc etc ggt act cct gga 432 
Gly Trp Ala Val Val He Thr Asp Tyr Glu Gly Leu Gly Thr Pro Gly 
130 135 140 

acc cac acc tac acc gtc ggc agg gcg cag gga cac gee atg etc gat 4 80 
Thr His Thr Tyr Thr Val Gly Arg Ala Gin Gly His Ala Met Leu Asp 
145 150 155 160 

gec gec cgc get gcg caa egg eta ccg ggc tec ggc ctg acg acc gac 528 
Ala Ala Arg Ala Ala Gin Arg Leu Pro Gly Ser Gly Leu Thr Thr Asp 
165 170 175 

tgc ccg gtc ggc ate tgg ggc tat gcg cag ggt ggg caa gcg teg gec 576 
Cys Pro Val Gly He Trp Gly Tyr Ala Gin Gly Gly Gin Ala Ser Ala 
180 185 190 

ttc gee ggc gaa ctg cac ccc acc tac gca cct gaa ctg cga ate cgc 624 
Phe Ala Gly Glu Leu His Pro Thr Tyr Ala Pro Glu Leu Arg He Arg 
195 200 205 

get gcg gee gca ggt gcg gtg ccg ate gat ctg ctg gac ate ate cac 672 
Ala Ala Ala Ala Gly Ala Val Pro He Asp Leu Leu Asp He He His 
210 215 22.0 

cga aat gac ggg gtg ttc acc ggg ccg gtg ctg gee ggc ctg gtc ggg 720 
Arg Asn Asp Gly Val Phe Thr Gly Pro Val Leu Ala Gly Leu Val Gly 
225 230 235 240 

cat gec get gee tac ccc gat ctg cca ttc gac gag ctt etc acc gaa 768 
His Ala Ala Ala Tyr Pro Asp Leu Pro Phe Asp Glu Leu Leu Thr Glu 
245 250 255 

gcg ggt cgt acc gee gtt gat caa gtg cgc gag etc ggt gca ccg gag 816 
Ala Gly Arg Thr Ala Val Asp Gin Val Arg Glu Leu Gly Ala Pro Glu 
260 265 270 

etc gtc acc cgc ttc etc ggc cgc gag ctg age gac ttc etc gac act 864 
Leu Val Thr Arg Phe Leu Gly Arg Glu Leu Ser Asp Phe Leu Asp Thr 
275 280 285 

tec ggc ctt ttc gag caa cct cga tgg cgc gca cga ctg gee gaa age 912 
Ser Gly Leu Phe Glu Gin Pro Arg Trp Arg Ala Arg Leu Ala Glu Ser 
290 295 300 

gtc gca ggt agg aac ggt ggc ccg gtg gtc ccc acg etc gtc tac cac 960 
Val Ala Gly Arg Asn Gly Gly Pro Val Val Pro Thr Leu Val Tyr His 
305 310 315 320 
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age acg gac gac gag ate gtt ccg ttc gca ttc ggc gag cga etc egg 1008 
Ser .Thr Asp Asp Glu He Val Pro Phe Ala Phe Gly Glu Arg Leu Arg 
325 330 335 

gac age tac cgc gcg gcg ggt acg cca gtg egg tgg cat ccg etc tec 1056 
Asp Ser Tyr Arg Ala Ala Gly Thr Pro Val Arg Trp His Pro Leu Ser 
340 345 350 

gga ttg get cac ttt ccc gccgcc ctg gee age teg cga gtg gtc gtc 1104 
Gly Leu Ala His Phe Pro Ala Ala Leu Ala Ser Ser Arg Val Val Val 
355 360 365 

teg tgg ttc gac gag cac ttc tec gag ccg tec gcg ate age ggt ccg 1152 
Ser Trp Phe Asp Glu His Phe Ser Glu Pro Ser Ala He Ser Gly Pro 
370 375 380 

cga gat gee agg tga 1167 

Arg Asp Ala Arg 

385 



<210> 22 
<211> 388 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 22 

Met Ala Ser Glu His Ala Ser Leu Val Gly Asp Asp Leu Arg Ala Pro 
15 10 15 

Ala Asp Asp Pro Phe Tyr Arg Pro Pro Thr Pro Leu Pro Pro Gly Val 
20 25 30 

Pro Gly Thr Leu Leu Arg Ala Arg Pro Val Ser Ala Leu Arg Gly Thr 
35 40 45 

Gly Glu Pro Val Ala Ala Lys Ala Trp Gin He Leu Tyr Arg Ser Asn 
50 55 60 

Ser Ala Leu Gly Met Pro Asn Ala Val Ser Gly Thr Val Leu Val Pro 
65 70 75 80 

Asn He Pro Trp Pro Arg Glu Asp Arg Pro lie He Thr Phe Ala Val 
85 90 95 

Gly Thr His Gly Leu Gly Ser Gin Val Ala Pro Ser Tyr Leu Leu Arg 
100 105 110 
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Thr Gly Thr Glu Pro Glu Thr Glu Leu lie Ala Val Ala Leu Asp Arg 
115 120 125 

Gly Trp Ala Val Val lie Thr Asp Tyr Glu Gly Leu Gly Thr Pro Gly 
130 135 140 

Thr His Thr Tyr Thr Val Gly Arg Ala Gin Gly His Ala Met Leu Asp 
145 150 155 160 

Ala Ala Arg Ala Ala Gin Arg Leu Pro Gly Ser Gly Leu Thr Thr Asp 
165 170 175 

Cys Pro Val Gly He Trp Gly Tyr Ala Gin Gly Gly Gin Ala Ser Ala 
180 185 190 

Phe Ala Gly Glu Leu His Pro Thr Tyr Ala Pro Glu Leu Arg He Arg 
195 200 205 

Ala Ala Ala Ala Gly Ala Val Pro He Asp Leu Leu Asp He He His 
210 215 220 

- Arg Asn Asp Gly Val Phe Thr Gly Pro Val Leu Ala Gly Leu Val Gly 
225 230 235 240 

His Ala Ala Ala Tyr Pro Asp Leu Pro Phe Asp Glu Leu Leu Thr Glu 
245 250 255 

Ala Gly Arg Thr Ala Val Asp Gin Val Arg Glu Leu Gly Ala Pro Glu 
260 265 270 

Leu Val Thr Arg Phe Leu Gly Arg Glu Leu Ser Asp Phe Leu Asp Thr 
275 280 285 

Ser Gly Leu Phe Glu Gin Pro Arg Trp Arg Ala Arg Leu Ala Glu Ser 
290 295 300 

Val Ala Gly Arg Asn Gly Gly Pro Val Val Pro Thr Leu Val Tyr His 
305 310 315 320 

Ser Thr Asp Asp Glu He Val Pro Phe Ala Phe Gly Glu Arg Leu Arg 
325 330 335 

Asp Ser Tyr Arg Ala Ala Gly Thr Pro Val Arg Trp His Pro Leu Ser 
340 345 350 

Gly Leu Ala His Phe Pro Ala Ala Leu Ala Ser Ser Arg Val Val Val 
355 360 365 
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Ser Trp Phe Asp Glu His Phe Ser Glu Pro Ser Ala He Ser Gly Pro 
370 375 380 

Arg Asp Ala Arg - * 
385 



<210> 23 

<211> 1011 

<212> DNA 

<213> Saccharopolyspora spinosa 



<220> 

<221> CDS 

<222> (1)..{1008) 

<223> 0RF9; 2 , 3 -Reduktase 

<400> 23 

atg acc age teg atg cga aag ccg gtg cgc ate ggt gtg etc ggg tgc 4 8 
Met Thr Ser Ser Met Arg Lys Pro Val Arg He Gly Val Leu Gly Cys 
15 10 15 

get tec ttc gcg tgg cga egg atg ctg ccc gcg atg tgc gac gtg gee 96 
Ala Ser Phe Ala Trp Arg Arg Met Leu Pro Ala Met Cys Asp Val Ala 
20 25 30 

gaa aca gag gtg gtg gcg gtg gcg age cgt gat ccg gcg aaa gec gaa 144 
Glu Thr Glu Val Val Ala Val Ala Ser Arg Asp Pro Ala Lys Ala Glu 
35 40 45 

egg ttc gca gcg cga ttc gaa tgc gag gcg gtg ctg ggt tac cag egg 192 
Arg Phe Ala Ala Arg Phe Glu Cys Glu Ala Val Leu Gly Tyr Gin Arg 
50 55 60 

etc ctg gag egg ccg gac ate gat gee gtc tac gtg ccg ttg ccg cct 240 
Leu Leu Glu Arg Pro Asp He Asp Ala Val Tyr Val Pro Leu Pro Pro 
65 70 75 80 

ggc atg cat gca gag tgg ate ggc aag gcg ctt gag gca gac aaa cac 288 
Gly Met His Ala Glu Trp He Gly Lys Ala Leu Glu Ala Asp Lys His 
85 90 95 



gtg ctt gcg gag aaa ccg ctg acg acg acg gcg tec gac acc get cgc 336 
Val Leu Ala Glu Lys Pro Leu Thr Thr Thr Ala Ser Asp Thr Ala Arg 
100 .105 110 
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ctg gtc ggg ctg gcc agg agg aag aac ctg ctg ctg egg gag aat tac 384 
Leu Val Gly Leu Ala Arg Arg Lys Asn Leu Leu Leu Arg Glu Asn Tyr 
115 120 125 

ctg ttc etc cac cac ggc egg cac gac gtg gtc cgc gac ctg ctg caa 432 
Leu Phe Leu His His Gly Arg His Asp Val Val Arg Asp Leu Leu Gin 
130 135 140 

tec ggg gag ate ggt gag etc egg gag ttc ace gcc gtg ttc gga att 480 
Ser Gly Glu lie Gly Glu Leu Arg Glu Phe Thr Ala Val Phe Gly He 
145 . 150 155 160 

ccg ccg % ctt ccc gac acg gac ate cgc tat cgc acc gaa etc ggt ggc 528 
Pro Pro Leu Pro Asp Thr Asp He Arg Tyr Arg Thr Glu Leu Gly Gly 
165 170 175 

gga gcg ttg ctg gac ate ggt gtc tat ccc gcc cgt gcc get egg cac 576 
Gly Ala Leu Leu Asp He Gly Val Tyr Pro Ala Arg Ala Ala Arg His 
180 185 190 

ttt etc etc ggt ccg etc acg gtt etc ggc gca age teg cac gag gcc 624 
Phe Leu Leu Gly Pro Leu Thr Val Leu Gly Ala Ser Ser His Glu Ala 
195 200 205 

cag gag teg ggc gtc gac ttg teg ggc age gtg ctg etc caa teg gaa 672 
Gin Glu Ser Gly Val Asp Leu Ser Gly Ser Val Leu Leu Gin Ser Glu 
210 215 220 

ggt ggc acc gtt gcc cac etc gga tac ggt ttc gtg cac cac tac cgc 720 
Gly Gly Thr Val Ala His Leu Gly Tyr Gly Phe Val His His Tyr Arg . 
225 230 235 240 

age gcg tac gag ctg tgg ggg agt cgt ggg cga ate gtc gtc gac egg 768 
Ser Ala Tyr Glu Leu Trp Gly Ser Arg Gly Arg He Val Val Asp Arg 
245 250 255 

gcg ttc acg ccg ccc gcc gag tgg cag gcc gtg ate cga ate gag egg 816 
Ala Phe Thr Pro Pro Ala Glu Trp Gin Ala Val lie Arg lie Glu Arg 
260. 265 270 

aag ggc gtt gtc gac gag ttg tec ttg cca gcg gaa gat cag gtt cgc 864 
Lys Gly Val Val Asp Glu Leu Ser Leu Pro Ala Glu Asp Gin Val Arg 
275 280 285 

aag gcg gtc acc gcc ttc gca cgc gac ate aga gca ggg aca ggc gtg 912 
Lys Ala Val Thr Ala Phe Ala Arg Asp He Arg Ala Gly Thr Gly Val 
290 295 300 
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gac gac cct gcg gtg gcc gga gat teg 
Asp Asp Pro Ala Val Ala Gly Asp Ser 
305 310 

gcc gcg ctg gtg gag gcg ate ggt cag 
Ala Ala Leu Val Glu Ala He Gly Gin 
325 

tag 



ggc gaa teg atg ate cag cag 960 
Gly Glu Ser Met He Gin Gin 
315 320 

gcc cgt egg tgc ggg tec aca 1008 
Ala Arg Arg Cys Gly Ser Thr 
330 335 

1011 



<210> 24 
<211> 336 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 24 

Met Thr Ser Ser Met Arg Lys Pro Val Arg He Gly Val Leu Gly Cys 
15 10 15 

Ala Ser Phe Ala Trp Arg Arg Met Leu Pro Ala Met Cys Asp Val Ala 
20 25 30 

Glu Thr Glu Val Val Ala Val Ala Ser Arg Asp Pro Ala Lys Ala Glu 
35 40 45 

Arg Phe Ala Ala Arg Phe Glu Cys Glu Ala Val Leu Gly Tyr Gin Arg 
50 55 60 

Leu Leu Glu Arg Pro Asp He Asp Ala Val Tyr Val Pro Leu Pro Pro 
65 70 75 80 

Gly Met His Ala Glu Trp He Gly Lys Ala Leu Glu Ala Asp Lys His 
85 90 95 

Val Leu Ala Glu Lys Pro Leu Thr Thr Thr Ala Ser Asp Thr Ala Arg 
100 105 110 

Leu Val Gly Leu Ala Arg Arg Lys Asn Leu Leu Leu Arg Glu As«n Tyr 
115 120 125 

Leu Phe Leu His His Gly Arg His Asp Val Val Arg Asp Leu Leu Gin 
130 135 140 

Ser Gly Glu He Gly Glu Leu Arg Glu Phe Thr Ala Val Phe Gly He 
145 150 155 160 

Pro Pro Leu Pro Asp Thr Asp He Arg Tyr Arg Thr Glu Leu Gly Gly 
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165 170 175 

Gly Ala Leu Leu Asp lie Gly Val Tyr Pro Ala Arg Ala Ala Arg His 
180 185 190 

Phe Leu Leu Gly Pro Leu Thr Val Leu Gly Ala Ser Ser His Glu Ala 
195 200 205 

Gin Glu Ser Gly Val Asp Leu Ser Gly Ser Val Leu Leu Gin Ser Glu 
210 215 220 

Gly Gly Thr Val Ala His Leu Gly Tyr Gly Phe Val His His Tyr Arg 
225 230 235 240 

Ser Ala Tyr Glu Leu Trp Gly Ser Arg Gly Arg lie Val Val Asp Arg 
245 250 255 

Ala Phe Thr Pro Pro Ala Glu Trp Gin Ala Val He Arg He Glu Arg 
260 265 270 

Lys Gly Val Val Asp Glu Leu Ser Leu Pro Ala Glu Asp Gin Val Arg 
275 280 285 

Lys Ala Val Thr Ala Phe Ala Arg Asp He Arg Ala Gly Thr Gly Val 
290 295 300 

Asp Asp Pro Ala Val Ala Gly Asp Ser Gly Glu Ser Met He Gin Gin 
305 310 315 320 

Ala Ala Leu Val Glu Ala He Gly Gin Ala Arg Arg Cys Gly Ser Thr 
325 330 335 



<210> 25 
<211> 1461 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1458) 

<223> ORF10; 2 , 3 -Dehydratase 

<400> 25 

atg age agt tct gtc gaa get gag gca agt get get gcg ccg etc ggc 48 

Met Ser Ser Ser Val Glu Ala Glu Ala Ser Ala Ala Ala Pro Leu Gly 

15 10 15 
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age aac aac acg egg egg ttc gtc gac tct gcg ctg age get tgc aat 96 
Ser Asn Asn Thr Arg Arg Phe Val Asp Ser Ala Leu Ser Ala Cys Asn 
20 25 30 

ggc atg att ccg acc acg gag ttc cac tgc tgg etc gee gat egg ctg 144 
Gly Met lie Pro Thr Thr Glu Phe His Cys Trp Leu Ala Asp Arg Leu 
35 40 45 

ggc gag aac age ttc gag acc aat cgc ate ccg ttc gac cgc ctg teg 192 
Gly Glu Asn Ser Phe Glu Thr Asn Arg lie Pro Phe Asp Arg Leu Ser 
50 55 60 

aaa tgg aaa ttc gat gee age acg gag aac ctg gtt cat gee gac ggt 24 0 
Lys Trp Lys Phe Asp Ala Ser Thr Glu Asn Leu Val His Ala Asp Gly 
65 70 75 80 

agg ttc ttc acg gta gaa ggc ctg cag gtc gag acc aac tat- ggc gcg 288 
Arg Phe Phe Thr Val Glu Gly Leu Gin Val Glu Thr Asn Tyr Gly Ala 
85 90 95 

gca ccc age tgg cac cag ccg ate ate aac cag get gaa gta ggt ate 336 
Ala Pro Ser Trp His Gin Pro He He Asn Gin Ala Glu Val Gly He 
100 105 110 

etc ggc att etc gtc aag gag ate gac ggc gtg ctg cac tgc etc atg 3 84 
Leu Gly He. Leu Val Lys Glu He Asp Gly Val Leu His Cys Leu Met 
115 120 125 

tea gca aag atg gaa ccg ggc aac gtc aac gtc ctg cag etc teg ccg 432 
•Ser Ala Lys Met Glu Pro Gly Asn Val Asn Val Leu Gin Leu Ser Pro 
130 135 140 

acg gtt cag gca act egg age aac tac acg cag gca cac cgt ggc age 480 
Thr Val Gin Ala Thr Arg Ser Asn Tyr Thr Gin Ala His Arg Gly Ser 
145 150 155 160 

gtt ccg ccc tat gtg gac tac ttc etc ggg egg ggc cgc ggc cgc gtg 52 8 
Val Pro Pro Tyr Val Asp Tyr Phe Leu Gly Arg Gly Arg Gly Arg Val 
165 170 175 

ctg gta gac gtg etc cag tct gaa cag ggg tec tgg ttc tac egg aag 576 
Leu Val Asp Val Leu Gin Ser Glu Gin Gly Ser Trp Phe Tyr Arg Lys 
180 185 190 

cgc aac egg aac atg gtg gtg gaa gtc cag gag gaa gtg cca gtc ctg 624 
Arg Asn Arg Asn Met Val Val Glu Val Gin Glu Glu Val Pro Val Leu 
195 200 205 



124 



WO 01/16303 PCT/EP00/08013 



cca gac ttc tgc tgg ttg acg etc ggc cag gtg ctg get etc ctt cgt 672 
Pro Asp Phe Cys Trp Leu Thr Leu Gly Gin Val Leu Ala Leu Leu Arg 
210 215 220 

cag gac aac ate gtc aac atg gac acc egg acg gtg ctg tct tgc ate 720 
Gin Asp Asn lie Val Asn Met Asp Thr Arg Thr Val Leu Ser Cys lie 
225 230 235 240 

ccg ttc cac gat tec gee acc gga ccc gaa eta gec gee teg gag gag 768 
Pro Phe His Asp Ser Ala Thr Gly Pro Glu Leu Ala Ala Ser Glu Glu 
245 250 255 

ccc ttc cga cag gcg gtg gee agg teg etc teg cac ggc ate gat teg 816 
Pro Phe Arg Gin Ala Val Ala Arg Ser Leu Ser His Gly lie Asp Ser 
260 265 270 

teg agt ate tec gag gcg gtc ggt tgg ttc gag gaa gee aag gee cgc 864 
Ser Ser lie Ser Glu Ala Val Gly Trp Phe Glu Glu Ala Lys Ala Arg 
275 280 285 

tac cgc ttg egg gca acg cgc gtt ccg ctg age agg gtc gac aag tgg 912 
Tyr Arg Leu Arg Ala Thr Arg Val Pro Leu Ser Arg Val Asp Lys Trp 
290 295 300 

tat cgc acc gat acc gag ate gee cac cag gac ggc aag tac ttc gcg 960 
Tyr Arg Thr Asp Thr Glu lie Ala His Gin Asp Gly Lys Tyr Phe Ala 
305 310 315 320 

gtg ate gcg gtg teg gtg tec gcg acc aat cgt gag gtc gee age tgg 1008 
Val lie Ala Val Ser Val Ser Ala Thr Asn Arg Glu Val Ala Ser Trp 
325 330 335 

acg cag ccg atg ate gaa ccg cga gaa caa ggt gag ate gca ctg ttg 1056 
Thr Gin Pro Met lie Glu Pro Arg Glu Gin Gly Glu He Ala Leu Leu 
340 345 350 

gtc aag egg ate ggc gga gtg ctg cac ggt ttg gtc cac get egg gtg 1104 
Val Lys Arg He Gly Gly Val Leu His Gly Leu Val His Ala Arg Val 
355 360 365 

gag get ggg tat aag tgg act gcg gaa ate get ccc acg gtc cag tgc 1152 
Glu Ala Gly Tyr Lys Trp Thr Ala Glu He Ala Pro Thr Val Gin Cys 
370 375 380 

agt gtg gee aac tac caa age acc ccg teg aac gac tgg ccg ccg ttc 1200 
Ser Val Ala Asn Tyr Gin Ser Thr Pro Ser Asn Asp Trp Pro Pro Phe 
385 390 395 400 
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ttg gac gac gtg etc acc gec gat ccc gaa acc gtg egg tac gaa teg 1248 
Leu Asp Asp Val Leu Thr Ala Asp Pro Glu Thr Val Arg Tyr Glu Ser 
405 410 415 

ate ctg tec gaa gaa ggc ggt egg ttc tac cag gcg cag aac agg tac 1296 
He Leu Ser Glu Glu Gly Gly Arg Phe Tyr Gin Ala Gin Asn Arg Tyr 
420 425 430 

egg ate ate gag gtg cat gag gac ttc gcg gca cga. cct ccc age gac 1344 
Arg He He Glu Val His Glu Asp Phe Ala Ala Arg Pro Pro Ser Asp 
435 440 445 

ttc egg tgg atg act ttg gga cag ttg ggc gag ctg etc egg age acc 1392 
. Phe Arg Trp Met Thr Leu Gly Gin Leu Gly Glu Leu Leu Arg Ser Thr 
450 455 460 

cac ttc ttg aac ate cag gcg cgc age ttg gtc gec tec ctg cat age 1440 
His Phe Leu Asn lie Gin Ala Arg Ser Leu Val Ala Ser Leu His Ser 
465 470 475 480 

ttg tgg gcg ttg ggg cga tga 1461 
Leu Trp Ala Leu Gly Arg 
485 



<210> 26 
<211> 486 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 26 

Met Ser Ser Ser Val Glu Ala Glu Ala Ser Ala Ala Ala Pro Leu Gly 
15 10 15 

Ser Asn Asn Thr Arg Arg Phe Val Asp Ser Ala Leu Ser Ala Cys Asn 
20 25 30 

Gly Met He Pro Thr Thr Glu Phe His Cys Trp Leu Ala Asp Arg Leu 
35 40 45 

Gly Glu Asn Ser Phe Glu Thr Asn Arg He Pro Phe Asp Arg Leu Ser 
50 55 60 

Lys Trp Lys Phe Asp Ala Ser Thr Glu Asn Leu Val His Ala Asp Gly 
65 70 75 80 

Arg Phe Phe Thr Val Glu Gly Leu Gin Val Glu Thr Asn Tyr Gly Ala 
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85 90 95 

Ala Pro Ser Trp His Gin Pro He He Asn Gin Ala Glu Val Gly He 
100 105 110 

Leu Gly He Leu Val Lys Glu He Asp Gly Val Leu His Cys Leu Met 
115 120 125 

Ser Ala Lys Met Glu Pro Gly Asn Val Asn Val Leu Gin Leu Ser Pro 
130 135 140 

Thr Val Gin Ala Thr Arg Ser Asn Tyr Thr Gin Ala His Arg Gly Ser 
145 150 155 160 

Val Pro Pro Tyr Val Asp Tyr Phe Leu Gly Arg Gly Arg Gly Arg Val 
165 170 175 

Leu Val Asp Val Leu Gin Ser Glu Gin Gly Ser Trp Phe Tyr Arg Lys 
180 185 190 

Arg Asn Arg Asn Met Val Val Glu Val Gin Glu Glu Val Pro Val Leu 
195 200 205 

Pro Asp Phe Cys Trp Leu Thr Leu Gly Gin Val Leu Ala Leu Leu Arg 
210 215 220 

Gin Asp Asn He Val Asn Met Asp Thr Arg Thr Val Leu Ser Cys He 
225 230 235 240 

Pro Phe His Asp Ser Ala Thr Gly Pro Glu Leu Ala Ala Ser Glu Glu 
245 250 255 

Pro Phe Arg Gin Ala Val Ala Arg Ser Leu Ser His Gly He Asp Ser 
260 265 270 

Ser Ser He Ser Glu Ala Val Gly Trp Phe Glu Glu Ala Lys Ala Arg 
275 280 285 

Tyr Arg Leu Arg Ala Thr Arg Val Pro Leu Ser Arg Val Asp Lys Trp 
290 295 300 

Tyr Arg Thr Asp Thr Glu He Ala His Gin Asp Gly Lys Tyr Phe Ala 
305 310 315 320 

Val He Ala Val Ser Val Ser Ala Thr Asn Arg Glu Val Ala Ser Trp 
325 330 335 

Thr Gin Pro Met He Glu Pro Arg Glu Gin Gly Glu He Ala Leu Leu 
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340 345 350 

Val Lys Arg He Gly Gly Val Leu His Gly Leu Val His Ala Arg Val 
355 360 365 

Glu Ala Gly Tyr Lys Trp Thr Ala Glu He Ala Pro Thr Val Gin Cys 
370 375 380 

Ser Val Ala Asn Tyr Gin Ser Thr Pro Ser Asn Asp Trp Pro Pro Phe 
385 390 395 400 

Leu Asp Asp Val Leu Thr Ala Asp Pro Glu Thr Val Arg Tyr Glu Ser 
405 410 415 

He Leu Ser Glu Glu Gly Gly Arg Phe Tyr Gin Ala Gin Asn Arg Tyr 
420 425 430 

Arg He He Glu Val His Glu Asp Phe Ala Ala Arg Pro Pro Ser Asp 
435 440 445 

Phe Arg Trp Met Thr Leu Gly Gin Leu Gly Glu Leu Leu Arg Ser Thr 
450 455 460 

His Phe Leu Asn He Gin Ala Arg Ser Leu Val Ala Ser Leu His Ser 
465 470 475 480 

Leu Trp Ala Leu Gly Arg 
485 



<210> 27 
<211> 524 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (438) 

<223> ORFll; Thioesterase 

<400> 27 

gtg age aac gtg tgg ccg gaa aca 

Val Ser Asn Val Trp Pro Glu Thr 

1 5 

tea teg ctg ttg cgt cga etc gga 
Ser Ser Leu Leu Arg Arg Leu Gly 



tgg acg ccg ggg ttt ggc agg tgt 48 

Trp Thr Pro Gly Phe Gly Arg Cys 
10 15 

ttc cgc cgt gac egg gac gat gee 96 

Phe Arg Arg Asp Arg Asp Asp Ala 
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20 25 30 

agg cga gtc ccg aag tea gat tct tgt cca gaa teg tec aat ggg gtg 144 
Arg Arg Val Pro Lys Ser Asp Ser Cys Pro Glu Ser Ser Asn Gly Val 
35 40 45 

ttg ate tec cca gag gtt tgc get cca ace gat ttc cga cga gga teg 192 
Leu lie Ser Pro Glu Val Cys Ala Pro Thr Asp Phe Arg Arg Gly Ser 
50 55 60 

tgg cgc ccg ctg age aac gac tac cgt gcg gtc gag aca tac cgc tgt 240 
Trp Arg Pro Leu Ser Asn Asp Tyr Arg Ala Val Glu Thr Tyr Arg Cys 
65 70 75 80 

gcg cca gga gcg aag gtg ggt tgc ccg ate acc gtg ctg gtg gta gat 288 
Ala Pro Gly Ala Lys Val Gly Cys Pro lie Thr Val Leu Val Val Asp 
85 90 95 

gec gag ccg aag gtc acc ttg gat gag gcg gaa gee tgg cga gag cac 336 
Ala Glu Pro Lys Val Thr Leu Asp Glu Ala Glu Ala Trp Arg Glu His 
100 105 110 

acc gag gee gtg gee gac gtc cgt gtc ttc tec ggc ggg cat ttc ttc 384 
Thr Glu Ala Val Ala Asp Val Arg Val Phe Ser Gly Gly His Phe Phe 
115 120 125 

atg acc gaa cgc cag gac gag gtg etc gcg gtc ctt acg ggc gga teg 432 
Met Thr Glu Arg Gin Asp Glu Val Leu Ala Val Leu Thr Gly Gly Ser 
130 135 140 

ctt cga tgatcctcgc caggccgctg gaccagaccg cgacgcccct gggagcegge 488 

Leu Arg 

145 

gtgeacateg teaeggcagt gagggattgg gcatga 524 



<210> 28 
<211> 146 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 28 

Val Ser Asn Val Trp Pro Glu Thr Trp Thr Pro Gly Phe Gly Arg Cys 
1 5 10 15 

Ser Ser Leu Leu Arg Arg Leu Gly Phe Arg Arg Asp Arg Asp Asp Ala 
20 25 30 
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Arg Arg Val Pro Lys Ser Asp Ser Cys Pro Glu Ser Ser Asn Gly Val 
35 40 45 

Leu He Ser Pro Glu Val Cys Ala Pro Thr Asp Phe Arg Arg Gly Ser 
50 55 60 

Trp Arg Pro Leu Ser Asn Asp Tyr Arg Ala Val Glu Thr Tyr Arg Cys 
65 70 75 80 

Ala Pro Gly Ala Lys Val Gly Cys Pro He Thr Val Leu Val Val Asp 
85 90 95 

Ala Glu Pro Lys Val Thr Leu Asp Glu Ala Glu Ala Trp Arg Glu His 
100 105 110 

Thr Glu Ala Val Ala Asp Val Arg Val Phe Ser Gly Gly His Phe Phe 
115 120 125 

Met Thr Glu Arg Gin Asp Glu Val Leu Ala Val Leu Thr Gly Gly Ser 
130 135 140 

Leu Arg 
145 



<210> 29 
<211> 1320 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1317) 

<223> ORF12; Glycosyltransf erase 

<400> 29 

atg cgt gtc ctg ttc acq ccg ctg ccg gcg agt teg cac ttc ttc aac 4 8 
Met Arg Val Leu Phe Thr Pro Leu Pro Ala Ser Ser His Phe Phe Asn 
15 10 15 

ctg gtg ccg ttg gcg tgg gcg ttg cgt gec gcg ggg cac gag gtc cgt 96 
Leu Val Pro Leu Ala Trp Ala Leu Arg Ala Ala Gly His Glu Val Arg 
20 25 30 

gtc gec ate tgc ccg aat atg gtg teg atg gtc acc gga gca gga etc 144 
Val Ala He Cys Pro Asn Met Val Ser Met Val Thr Gly Ala Gly Leu 
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35 40 45 

acc gcg gtt ccc gtc ggc gac gag etc gac etc ate tec ttg gcg gec 192 
Thr Ala Val Pro Val Gly Asp Glu Leu Asp Leu lie Ser Leu Ala Ala 
50 55 60 

aag aac gaa etc gtt etc ggc age ggg gtc teg ttc gac gag aag ggg 24 0 
Lys Asn Glu Leu Val Leu Gly Ser Gly Val Ser Phe Asp Glu Lys Gly 
65 70 75 80 

egg cat ccg gaa etc ttc gac gag ctg ctg tea ate aac tec ggc aga 288 
Arg His Pro Glu Leu Phe- Asp Glu Leu Leu Ser He Asn Ser Gly Arg 
85 90 95 

gac acg gac gee gtg gag caa etc cac ctt gtg gat gac cga teg ctg 336 
Asp Thr Asp Ala Val Glu Gin Leu His Leu Val Asp Asp Arg Ser Leu 
100 105 110 

gac gat etc atg ggg ttc gee gag aaa tgg cag cct gat etc gtt gtg 384 
Asp Asp Leu Met Gly Phe Ala Glu Lys Trp Gin Pro Asp Leu Val Val 
115 120 125 

tgg gac get atg gtg tgt teg ggg cca gtt gtg gcg cga gcg etc ggc 432 
Trp Asp Ala Met Val Cys Ser Gly Pro Val Val Ala Arg Ala Leu Gly 
130 135 140 

gca cga cac gtg egg atg etc gtc gee etc gat gtg teg ggg tgg ctg 480 
Ala Arg His Val Arg Met Leu Val Ala Leu Asp Val Ser Gly Trp Leu 
145 150 155 160 

egg tec ggt ttc etc gaa tac cag gaa teg aag ccg cct gag cag cgc 528 
Arg Ser Gly Phe Leu Glu Tyr Gin Glu Ser Lys Pro Pro Glu Gin Arg 
165 170 175 

gtc gac ccg etc ggg acg tgg ctg gga gcg aag etc gee aag ttc gga 576 
Val Asp Pro Leu Gly Thr Trp Leu Gly Ala Lys Leu Ala Lys Phe Gly 
180 185 190 

gee acg ttc gat gaa gag ate gtg acg ggc caa gcg acc ata gat ccg 624 
Ala Thr Phe Asp Glu Glu He Val Thr Gly Gin Ala Thr He Asp Pro 
195 200 205 

att cca tec tgg atg cgc ctg cct gtg gac ttg gac tac ate teg atg 672 
He Pro Ser Trp Met Arg Leu Pro Val Asp Leu Asp Tyr He Ser Met 
210 215 220 

cgt ttc gtg ccg tac aac ggt ccg gcg gtg ttg ccg gag tgg ttg cgc 720 
Arg. Phe Val Pro Tyr Asn Gly Pro Ala Val Leu Pro Glu Trp Leu Arg 
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225 230 235 240 

gaa cga ccg acg aag ccg cgc gtc tgc ate acg cgc ggg ctg acc aag 768 
Glu Arg Pro Thr Lys Pro Arg Val' Cys lie Thr Arg Gly Leu Thr Lys 
245 250 255 

egg egg ctg age agg gtg acc gaa cag tac ggg gag caa agt gac cag 816 
Arg Arg Leu Ser Arg Val Thr Glu Gin Tyr Gly Glu Gin Ser Asp Gin 
260 265 270 

gaa caa gca atg gtg gaa agg ttg ttg cgc ggc gcg gec agg etc gac 864 
Glu Gin Ala Met Val Glu Arg Leu Leu Arg Gly Ala Ala Arg Leu Asp 
275 280 285 

gtc gag gtg ate gec acc ttg tct gac gac gaa gta egg gag atg ggg 912 
Val Glu Val lie Ala Thr Leu Ser Asp Asp Glu Val Arg Glu Met Gly 
290 295 300 

gag ttg ccc teg aac gtc egg gtc cac gaa tac gta ccg etc aac gaa 960 
Glu Leu Pro Ser Asn Val Arg Val His Glu Tyr Val Pro Leu Asn Glu 
305 310 315 320 

ctg ctg gag teg tgt tea gtg ate ate cat cat ggc teg acg acg acg 1008 
Leu Leu Glu Ser Cys Ser Val lie lie His His Gly Ser Thr Thr Thr 
325 330 335 

cag gaa acc gee acg gtc aac ggc gta ccg cag ttg att etc cct ggg 1056 
Gin Glu Thr Ala Thr Val Asn Gly Val Pro Gin Leu lie Leu Pro Gly 
340 345 350 

acc ttc tgg gac gaa tct cgt agg gcg gag etc eta gee gat egg gga 1104 
Thr Phe Trp Asp Glu Ser Arg Arg Ala Glu Leu Leu Ala Asp Arg Gly 
355 360 365 

gec ggt ctg gtc etc gac ccc gcg acg ttt acc gaa gac gac gtg cga 1152 
Ala Gly Leu Val Leu Asp Pro Ala Thr Phe Thr Glu Asp Asp Val Arg 
370 375 380 

ggt cag ctg gee cgc ctg etc gac gag ccg teg ttc get gec aat gcg 1200 
Gly Gin Leu Ala Arg Leu Leu Asp Glu Pro Ser Phe Ala Ala Asn Ala 
385 390 395 400 

gcg ctg ate cgc cgt gaa ate gag gaa agt ccc age ccg cac gac ate 1248 
Ala Leu He Arg Arg Glu He Glu Glu Ser Pro Ser Pro His Asp He 
405 410 415 

gtt cca cgt ctg gaa aag eta gtt gee gaa cgt gag aac cgc cgc act 1296 
Val Pro Arg Leu Glu Lys Leu Val Ala Glu Arg Glu Asn Arg Arg Thr 
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420 



425 



430 



ggg cag tct gat ggc cat ccg tga 
Gly Gin Ser Asp Gly His Pro 
435 - 



1320 



<210> 30 
<211> 439 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 30 

Met Arg Val Leu Phe Thr Pro Leu Pro Ala Ser Ser His Phe Phe Asn 
15 10 15 

Leu Val Pro Leu Ala Trp Ala Leu Arg Ala Ala Gly His Glu Val Arg 
20 25 30 

Val Ala lie Cys Pro Asn Met Val Ser Met Val Thr Gly Ala Gly Leu 
35 40 45 

Thr Ala Val Pro Val Gly Asp Glu Leu Asp Leu lie Ser Leu Ala Ala 
50 55 60 

Lys Asn Glu Leu Val Leu Gly Ser Gly Val Ser Phe Asp Glu Lys Gly 
65 70 75 80 

Arg His Pro Glu Leu Phe Asp Glu Leu Leu Ser lie Asn Ser Gly Arg 
85 90 95 

Asp Thr Asp Ala Val Glu Gin Leu His Leu Val Asp Asp Arg Ser Leu 
100 105 110 

Asp Asp Leu Met Gly Phe Ala Glu Lys Trp Gin Pro Asp Leu Val Val 
115 120 125 

Trp Asp Ala Met Val Cys Ser Gly Pro Val Val Ala Arg Ala Leu Gly 
130 135 140 

Ala Arg His Val Arg Met Leu Val Ala Leu Asp Val Ser Gly Trp Leu 
145 150 155 160 

Arg Ser Gly Phe Leu Glu Tyr Gin Glu Ser Lys Pro Pro Glu Gin Arg 
165 170 175 



Val Asp Pro Leu Gly Thr Trp Leu Gly Ala Lys Leu Ala Lys Phe Gly 
180 185 190 
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Ala Thr Phe Asp Glu Glu lie Val Thr Gly Gin Ala Thr lie Asp Pro 
195 200 205 

lie Pro Ser Trp Met Arg Leu Pro Val Asp Leu Asp Tyr He Ser Met 
210 215 220 

Arg Phe Val Pro Tyr Asn Gly Pro Ala Val Leu Pro Glu Trp Leu Arg 
225 230 235 240 

Glu Arg Pro Thr Lys Pro Arg Val Cys He Thr Arg Gly Leu Thr Lys 
245 250 255 

Arg Arg Leu Ser Arg Val Thr Glu Gin Tyr Gly Glu Gin Ser Asp Gin 
260 265 270 

Glu Gin Ala Met Val Glu Arg Leu Leu Arg Gly Ala Ala Arg Leu Asp 
275 280 285 

Val Glu Val He Ala Thr Leu Ser Asp Asp Glu Val Arg Glu Met Gly 
290 295 300 

Glu Leu Pro Ser Asn Val Arg Val His Glu Tyr Val Pro Leu Asn Glu 
305 310 315 320 

Leu Leu Glu Ser Cys Ser Val He He His His Gly Ser Thr Thr Thr 
325 330 335 

Gin Glu Thr Ala Thr Val Asn Gly Val Pro Gin Leu He Leu Pro Gly 
340 345 350 

Thr Phe Trp Asp Glu Ser Arg Arg Ala Glu Leu Leu Ala Asp Arg Gly 
355 360 365 

Ala Gly Leu Val Leu Asp Pro Ala Thr Phe Thr Glu Asp Asp Val Arg 
370 375 380 

Gly Gin Leu Ala Arg Leu Leu Asp Glu Pro Ser Phe Ala Ala Asn Ala 
385 390 395 400 

Ala Leu He Arg Arg Glu He Glu Glu Ser Pro Ser Pro His Asp He 
405 410 415 

Val Pro Arg Leu Glu Lys Leu Val Ala Glu Arg Glu Asn Arg Arg Thr 
420 425 430 

Gly Gin Ser Asp Gly His Pro 
435 
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<210> 31 
<211> 1389 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1386) 

<223> ORF13; 3 , 4 -Dehydratase 

<400> 31 

atg cag age egg aaa acc aga gcg ctg ggg aaa ggg cgc gec aga gtg 48 
Met. Gin' Ser Arg Lys Thr Arg Ala Leu Gly Lys Gly Arg Ala Arg Val 
15 10 15 

act teg tgt gac gac act tgc get acc get act gag atg gtg ccg gat 96 
Thr Ser Cys Asp Asp Thr Cys Ala Thr Ala Thr Glu Met Val Pro Asp 
20 25 30 

gec aag gac egg ata ttg gca tec gta cgc gat tac cac cgc gaa cag 144 
Ala Lys Asp Arg lie Leu Ala Ser Val Arg Asp Tyr His Arg Glu Gin 
35 40 45 

gaa tec ccg acc ttc gtg get gga teg acg ccg ate egg cca teg ggc 192 
Glu Ser Pro Thr Phe Val Ala Gly Ser Thr Pro lie Arg Pro Ser Gly 
50 55 60 

gee gtg etc gac gag gac gac egg gtg gca ctg gtg gaa gee gcg ctg 240 
Ala Val Leu Asp Glu Asp Asp Arg Val Ala Leu Val Glu Ala Ala Leu 
65 70 75 80 

gag etc egg ate gee gcg ggc ggg aat gca egg cga ttc gag age gag 288 
Glu Leu Arg lie Ala Ala Gly Gly Asn Ala Arg Arg Phe Glu Ser Glu 
85 90 95 

ttc gec cgc ttc ttc ggc etc cgc aag get cat etc gtc aac tec ggt 336 
Phe Ala Arg Phe Phe Gly Leu Arg Lys Ala His Leu Val Asn Ser Gly 
100 105 110 

teg teg gec aat etc ctg gca ctg agt teg ctt acc tec ccc aaa etc 384 
Ser Ser Ala Asn Leu Leu Ala Leu Ser Ser Leu Thr Ser Pro Lys Leu 
115 120 125 

ggc gag gca cga ctg egg ccc ggc gac' gaa gtg ate act gcg gcg gtc 432 
Gly Glu Ala Arg Leu Arg Pro Gly Asp Glu Val lie Thr Ala Ala Val 
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130 135 140 

ggc ttc ccc acg acg ate aat ccg gcg gtc caa aac gga etc gtc ccg 480 

Gly Phe Pro Thr Thr He Asn Pro Ala Val Gin Asn Gly Leu Val Pro 

145 150 155 160 

gta ttc gtc gac gtg gaa ctg ggc acc tac aac gca acg cca gac cgc 

Val Phe Val Asp Val Glu Leu Gly Thr Tyr Asn Ala Thr Pro Asp Arg 
165 170 175 

ate aag gec gec gtc acg gaa egg acg cga gec atc'atg ctg gcg cac 576 

He Lys Ala Ala Val Thr Glu Arg Thr Arg Ala He Met Leu Ala His 
180 185 190 

acc ctg ggc aac ccc ttc gee get gac gaa ate gcg gag ate gca aaa 624 

Thr Leu Gly. Asn Pro Phe Ala Ala Asp Glu He Ala Glu He Ala Lys 
195 200 205 

gaa cac gag ctg ttc etc gtc gaa gac aac tgt gat gcg gtg gga tec 672 

Glu His Glu Leu Phe Leu Val Glu Asp Asn Cys Asp Ala Val Gly Ser 
210 215 220 



720 



768 



acc tac egg gga egg ctg acc gga acc ttc ggc gac ctg aca acg gtc 

Thr Tyr Arg Gly Arg Leu Thr Gly Thr Phe Gly Asp Leu Thr Thr Val 

225 230 235 240 

age ttc tat cct gee cat cac ate acc age ggc gag ggt ggc tgc gtg 

Ser Phe Tyr Pro Ala His His He Thr Ser Gly Glu Gly Gly Cys Val 
245 250 255 

ttg acc ggc age ctg gaa ttg get cgc ate ate gag teg ctg cgt gac 816 

Leu Thr Gly Ser Leu Glu Leu Ala Arg He He Glu Ser Leu Arg Asp 

260 265 270 

tgg gga egg gat tgc tgg tgc gag ccc ggc gtg gac aac acc tgc cgc 864 

Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Val Asp Asn Thr Cys Arg 

275 280 285 

aag agg ttc gac tac cac etc ggt acc ctt cca ccg ggc tac gac cac 912 

Lys Arg Phe Asp Tyr His Leu Gly Thr Leu Pro Pro Gly Tyr Asp His 

290 295 300 

aag tac acg ttc tec cac gtc ggt tac aac etc aag acc acc gac ctg 960 

Lys Tyr Thr Phe Ser His Val Gly Tyr Asn Leu Lys Thr Thr Asp Leu 

305 310 315 320 

cag gec gca ctt gcg ctg age cag ttg age aag att tec gca ttc ggg 1008 

Gin Ala Ala Leu Ala Leu Ser Gin Leu Ser Lys He Ser Ala Phe Gly 
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325 330 335 

teg gca cgc cgc cgt aac tgg cga egg ttg cgc gaa ggg ctg tec ggg 1056 
Ser Ala Arg Arg Arg Asn Trp Arg Arg Leu Arg Glu Gly Leu Ser Gly 
340 345 350 



ttg ccg ggc ctg ctg- ctg ccg gta gee aca ccg cac age gac ccg age 1104 
Leu Pro Gly Leu Leu Leu Pro Val Ala Thr Pro His Ser Asp Pro Ser 
355 360 365 

tgg ttc ggg ttt gcg ate acc ate agt gcg gac gee ggg ttc acc cgt 1152 
Trp Phe Gly Phe Ala lie Thr lie Ser Ala Asp Ala Gly Phe Thr Arg 
370 375 380 

gec gee ctg gtg aac. ttc ctg gaa tec cgc aac ate ggc acc cga ctg 1200 
Ala Ala Leu Val Asn Phe Leu Glu Ser Arg Asn lie Gly Thr Arg Leu 
385 390 395 400 

ctg ttc ggc ggt aac ate acc egg cac ccg gee ttc gag cag gtg egg 124 8 
Leu Phe Gly Gly Asn lie Thr Arg His Pro Ala Phe Glu Gin Val Arg 
405 410 415 

tac egg ate gee gac gcg etc acc aac age gac ate gtc acc gac cga 1296 
Tyr Arg He Ala Asp Ala Leu Thr Asn Ser Asp He Val Thr Asp Arg 
420 425 430 

acc ttc tgg gtc ggc gtc tac cca ggc ata acg gac caa atg ate gac 1344 
Thr Phe Trp Val Gly Val Tyr Pro Gly He Thr Asp Gin Met He Asp 
435 440 445 

tac gtc gtc gaa tea ate get gaa ttc gtg gee aag agt tec tag 1389 
Tyr Val Val ' Glu Ser He Ala Glu Phe Val Ala Lys Ser Ser 
450 455 460 



<210> 32 
<211> 462 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 32 

Met Gin Ser Arg Lys Thr Arg Ala Leu Gly Lys Gly Arg Ala Arg Val 
1 5 * 10 15 

Thr Ser Cys Asp Asp Thr Cys Ala Thr Ala Thr Glu Met Val Pro Asp 
20 25 30 

Ala Lys Asp Arg He Leu Ala Ser Val Arg Asp Tyr His Arg Glu Gin 
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35 



40 



45 



Glu Ser Pro Thr Phe Val Ala Gly Ser Thr Pro He Arg Pro Ser Gly 
50 55 60 

Ala Val Leu Asp Glu Asp Asp Arg Val Ala Leu Val Glu Ala Ala Leu 
65 70 75 80 

Glu Leu Arg He Ala Ala Gly Gly Asn Ala Arg Arg Phe Glu Ser Glu 
85 90 95 

Phe Ala Arg Phe Phe Gly Leu Arg Lys Ala His Leu Val Asn Ser Gly 
100 105 110 

Ser Ser Ala Asn Leu Leu Ala Leu Ser Ser Leu Thr Ser Pro Lys Leu 
115 120 125 

Gly Glu Ala Arg Leu Arg Pro Gly Asp Glu Val He Thr Ala Ala Val 
130 135 140 

Gly Phe Pro Thr Thr He Asn Pro Ala Val Gin Asn Gly Leu Val Pro 
145 150 155 160 

Val Phe Val Asp Val Glu Leu Gly Thr Tyr Asn Ala Thr Pro Asp Arg 
165 170 175 

He Lys Ala Ala Val Thr Glu Arg Thr Arg Ala He Met Leu Ala His 
180 185 190 

Thr Leu Gly Asn Pro Phe Ala Ala Asp Glu He Ala Glu He Ala Lys 
195 200 205 

Glu His Glu Leu Phe Leu Val Glu Asp Asn Cys Asp Ala Val Gly Ser 
210 215 220 

Thr Tyr Arg Gly Arg Leu Thr Gly Thr Phe Gly Asp Leu Thr Thr Val 
225 230 235 240 

Ser Phe Tyr Pro Ala His His He Thr Ser Gly Glu Gly Gly Cys Val 
245 250 255 

Leu Thr Gly Ser Leu Glu Leu Ala Arg He He Glu Ser Leu Arg Asp 
260 265 270 

Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Val Asp Asn Thr Cys Arg 
275 2B0 285 



Lys Arg Phe Asp Tyr His Leu Gly Thr Leu Pro Pro Gly Tyr Asp His 
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290 295 300 

Lys Tyr Thr Phe Ser His Val Gly Tyr Asn Leu Lys Thr Thr Asp Leu 
305 310 315 320 

Gin Ala Ala Leu Ala Leu Ser Gin Leu Ser Lys lie Ser Ala Phe Gly 
325 330 335 

Ser Ala Arg Arg Arg Asn Trp Arg Arg Leu. Arg Glu Gly Leu Ser Gly 
340 345 350 

Leu Pro Gly Leu Leu Leu Pro Val Ala Thr Pro His Ser Asp Pro Ser 
355 360 • 365 

Trp Phe Gly Phe Ala lie Thr lie Ser Ala Asp Ala Gly Phe Thr Arg 
370 375 380 

Ala Ala Leu Val Asn Phe Leu Glu Ser Arg Asn He Gly Thr Arg Leu 
385 390 395 400 

Leu Phe Gly Gly Asn He Thr Arg His Pro Ala Phe Glu Gin Val Arg 
405 410 415 

Tyr Arg He Ala Asp Ala Leu Thr Asn Ser Asp He Val Thr Asp Arg 
420 425 430 

Thr Phe Trp Val Gly Val Tyr Pro Gly He Thr Asp Gin Met He Asp 
435 440 445 

Tyr Val Val Glu Ser He Ala Glu Phe Val Ala Lys Ser Ser 
450 455 460 



<210> 33 
<211> 1158 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (1155) 

<223> ORF14; 4 -Aminotransferase 

<400> 33 

gtg ate aac ctg cac cag ccg ate etc ggc ace gaa gaa etc gac gcg 48 
Val He Asn Leu His Gin Pro He Leu Gly Thr Glu Glu Leu Asp Ala 
15 10 15 
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ate gcg gag gtg ttc gec tec aac tgg ate ggg etc ggg ccg cgc ace 96 
He Ala Glu Val Phe Ala Ser Asn Trp He Gly Leu Gly Pro Arg Thr 
20 25 30 

egg acg ttc gag gee gaa ttc gee cac cac ctg gga gtg gat ccc gaa 144 
Arg Thr Phe Glu Ala Glu Phe Ala His His Leu Gly Val Asp Pro Glu 
35 40 45 

cag gtc gtg ttc etc aac teg ggg act gec gcg ctg ttc ctt acc gtg 192 
Gin Val Val Phe Leu Asn Ser Gly Thr Ala Ala Leu Phe Leu Thr Val 
50 55 60 

cag gtg etc gac etc ggc cca ggc gac gac gtg gta ctt cct teg ata 240 
Gin Val Leu Asp Leu Gly Pro Gly Asp Asp Val Val Leu Pro Ser He 
65 70 75 80 

age ttc gtg gcg gcg gee aac gee ate gca tec tec ggt gee cgc ccg 2 88 
Ser Phe Val Ala Ala Ala Asn Ala He Ala Ser Ser Gly Ala Arg Pro 
85 90 95 

gtg ttc tgc gac gtc gac ccc egg acg ttg aac ccc acg ctg gat gat 336 
Val Phe Cys Asp Val Asp Pro Arg Thr Leu Asn Pro Thr Leu Asp Asp 
100 105 110 

gtg gcg agg gee ate acg ccg gcg acc aag gee gta ttg ctg etc cac 384 
Val Ala Arg Ala He Thr Pro Ala Thr Lys Ala Val Leu Leu Leu His 
115 120 125 

tat gga gga teg ccg gga gaa gtc acc gcg ate gee gat ttc tgc cgt 4 32 
Tyr Gly Gly Ser Pro Gly Glu Val Thr Ala He Ala Asp Phe Cys Arg 
130 135 140 

gaa aag ggc etc atg etc ate gag gac tec gee tgc gcg gtg gca teg 480 
Glu Lys Gly Leu Met Leu He Glu Asp Ser Ala Cys Ala Val Ala Ser 
145 150 155 160 

tec gtg cac ggc acc get tgc gga acc ttt ggt gac ctg gee acg tgg 528 
Ser Val His Gly Thr Ala Cys Gly Thr Phe Gly Asp Leu Ala Thr Trp 
165 170 175 

agt ttc gat gcg atg aag ate ctg gtc acc ggg gat ggg ggc atg ttc 576 
Ser Phe Asp Ala Met Lys He Leu Val Thr Gly Asp Gly Gly Met Phe 
180 185 190 

tac gcg gcg gat ccg gag ctg gcg cac cgc gca aga cga etc gec tac 624 
Tyr Ala Ala Asp Pro Glu Leu Ala His Arg Ala Arg Arg Leu Ala Tyr 
195 200 205 
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cac ggt ctt gag cag atg age gga ttc gat teg gee aag tct tec aac 672 
His Gly Leu Glu Gin Met Ser Gly Phe Asp Ser Ala Lys Ser Ser Asn 
210 215 220 

cgc tgg tgg gat att cgc gtc gaa gac ate ggc cag egg ctg ate ggg 72 0 
Arg Trp Trp Asp lie Arg Val Glu Asp He Gly Gin Arg Leu He Gly 
225 230 235 240 

aac gac atg acg gca gcg ctt ggc age gtg cag ctg cgc aaa ctg cca 768 
Asn Asp Met Thr Ala Ala Leu Gly Ser Val Gin Leu Arg Lys Leu Pro 
245 250 255 

gaa ttc ate aac agg cgt aga gaa ate get acg cag tac gac egg ttg 816 
Glu Phe He Asn Arg Arg Arg Glu He Ala Thr Gin Tyr Asp Arg Leu 
260 265 270 

ctt tec gat gtg ccg ggt gtc etc eta ccg ccg acg eta ccg gat ggg 864 
Leu Ser Asp Val Pro Gly Val Leu Leu Pro Pro Thr Leu Pro Asp Gly 
275 280 285 

cac gtc teg tea cac tac ttc tac tgg gtc cag ctg get ccg gag ate 912 
His Val Ser Ser His Tyr Phe Tyr Trp Val Gin Leu Ala Pro Glu He 
290 295 300 

cgc gac cag gtg gcg cag caa atg ctg gaa cgc ggc ate tac acg age 960 
Arg Asp Gin Val Ala Gin Gin Met Leu Glu Arg Gly He Tyr Thr Ser 
305 310 315 320 

tac cgc tac ccg ccc ctg cac aag gtc ccc ate tac cgc gcg gac tgc 1008 
Tyr Arg Tyr Pro Pro Leu His Lys Val Pro He Tyr Arg Ala Asp Cys 
325 330 335 

aag ctg cct tct gcg gag cac gee tgc cgc aga aca etc ctg eta cca 1056 
Lys Leu Pro Ser Ala Glu His Ala Cys Arg Arg Thr Leu Leu Leu Pro 
340 345 350 

ctg cac cca age ctt gac gac gee gag gtg cgc acg gtg get gac gag 1104 
Leu His Pro Ser Leu Asp Asp Ala Glu Val Arg Thr Val Ala Asp Glu 
355 360 365 

ttc cag aag gee gtc gaa cac cac ate age caa aga tea cca etc cga 1152 
Phe Gin Lys Ala Val Glu His His He Ser Gin Arg Ser Pro Leu Arg 
370 375 380 

aag tga 1158 

Lys 

385 
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<210> 34 

<211> 385 

<212> PRT 

<213> Saccharopolyspora spinosa 

<400> 34 

Val lie Asn Leu His Gin Pro He Leu Gly Thr Glu Glu Leu Asp Ala 
1 5 10 15 

He Ala Glu Val Phe Ala Ser Asn Trp He Gly Leu Gly Pro Arg Thr 
20 25 30 

Arg Thr Phe Glu Ala Glu Phe Ala His His Leu Gly Val Asp Pro Glu 
35 40 45 

Gin Val Val Phe Leu Asn Ser Gly Thr Ala Ala Leu Phe Leu Thr Val 
50 55 60 

Gin Val Leu Asp Leu Gly Pro Gly Asp Asp Val Val Leu Pro Ser He 
65 70 75 80 

Ser Phe Val Ala Ala Ala Asn Ala He Ala Ser Ser Gly Ala Arg Pro 
85 90 95 

Val Phe Cys Asp Val Asp Pro Arg Thr Leu Asn Pro Thr Leu Asp Asp 
100 105 110 

Val Ala Arg Ala He Thr Pro Ala Thr Lys Ala Val Leu Leu Leu His 
115 120 125 

Tyr Gly Gly Ser Pro Gly Glu Val Thr Ala He Ala Asp Phe Cys Arg 
130 135 140 

Glu Lys Gly Leu Met Leu He Glu Asp Ser Ala Cys Ala Val Ala Ser 
145 150 155 160 

Ser Val His Gly Thr Ala Cys Gly Thr Phe Gly Asp Leu Ala Thr Trp 
165 170 175 

Ser Phe Asp Ala Met Lys lie Leu Val Thr Gly Asp Gly Gly Met Phe 
180 185 190 

Tyr Ala Ala Asp Pro Glu Leu Ala His Arg Ala Arg Arg Leu Ala Tyr 
195 200 205 

His Gly Leu Glu Gin Met Ser Gly Phe Asp Ser Ala Lys Ser Ser Asn 
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210 215 220 

Arg Trp Trp Asp He Arg Val Glu Asp He Gly Gin Arg Leu He Gly 
225 230 235 240 

Asn Asp Met Thr Ala Ala Leu Gly Ser Val Gin Leu Arg Lys Leu Pro 
245 250 255 



Glu Phe He Asn Arg Arg Arg Glu lie Ala Thr Gin Tyr Asp Arg Leu 
260 265 270 

Leu Ser Asp Val Pro Gly Val Leu Leu Pro Pro Thr Leu Pro Asp Gly 
275 280 285 

His Val Ser Ser His Tyr Phe Tyr Trp Val Gin Leu Ala Pro Glu He 
290 295 300 



Arg Asp Gin Val Ala Gin Gin Met Leu Glu Arg Gly He Tyr Thr Ser 
305 310 315 320 

Tyr Arg Tyr Pro Pro Leu His Lys Val Pro He Tyr Arg Ala Asp Cys 
325 330 335 



Lys Leu Pro Ser Ala Glu His Ala Cys Arg Arg Thr Leu Leu Leu Pro 
340 345 350 

Leu His Pro Ser Leu Asp Asp Ala Glu Val Arg Thr Val Ala Asp Glu 
355 360 365 

Phe Gin Lys Ala Val Glu His His He Ser Gin Arg Ser Pro Leu Arg 
370 375 380 



Lys 
385 



<210> 35 
<211> 750 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (747) 

<223> ORF15; N-Dimethyltransf erase 
<400> 35 
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atg teg cgc gtg age gac aca ttc gca gaa acc tec teg gta tac age 48 
Met Ser Arg Val Ser Asp Thr Phe Ala Glu Thr Ser Ser Val Tyr Ser 
15 10 15 

cca gat cat gec gac ate tac gac gcg ate cac tec gcg cgt ggc egg 96 
Pro Asp His Ala Asp He Tyr Asp Ala He His Ser Ala Arg Gly Arg 
20 25 30 

gac tgg gca gec gag gee ggg gaa gta gtc cag etc gta cgc acc agg 144 
Asp Trp Ala Ala Glu Ala Gly Glu Val Val Gin Leu Val Arg Thr Arg 
35 40 45 

ctg ccc gaa gca cag tec eta etc gac gtc gee tgt ggg acc ggg gcg 192 
Leu Pro Glu Ala Gin Ser Leu Leu Asp Val Ala Cys Gly Thr Gly Ala 
50 55 60 

cac eta gag cga ttc cgt gee gaa tac gcg aag gtc gcg ggg ctt gaa 240 
His Leu Glu Arg Phe Arg Ala Glu Tyr Ala Lys Val Ala Gly Leu Glu 
65 70 75 80 

ctg tec gat gcg atg egg gag ate gcg ate aga cga gtc cct gag gta 2 88 
Leu Ser Asp Ala Met Arg Glu He Ala He Arg Arg Val Pro Glu Val 
85 90 95 

ccg att cac ate ggt gac ate cgc gat ttc gac etc ggc gag cca ttc 336 
Pro lie His He Gly Asp He Arg Asp Phe Asp Leu Gly Glu Pro Phe 
100 105 110 

gac gtc ate acc tgc ctg tgc ttt acc gcg get tac atg egg acc gtt 384 
Asp Val He Thr Cys Leu Cys Phe Thr Ala Ala Tyr Met Arg Thr Val 
115 120 125 

gac gac ctg cga cgc gtg acg egg aac atg gee egg cac ctg gee cct 432 
Asp Asp Leu Arg Arg Val Thr Arg Asn Met Ala Arg His Leu Ala Pro 
130 135 140 

ggt gga gtc gcg gtc ate gaa ccc tgg tgg ttt ccc gac aag ttc ate 480 
Gly Gly Val Ala Val He Glu Pro Trp Trp Phe Pro Asp Lys Phe He 
145 150 155 160 

gac ggg ttc gtc acc gga gec gtc gcg cac cac ggc gag egg gtg ate 528 
Asp Gly Phe Val Thr Gly Ala Val Ala His His Gly Glu Arg Val He 
165 170 175 

age egg eta teg cac teg gtc ctg gag ggc cgt acg age egg atg acc 576 
Ser Arg Leu Ser His Ser Val Leu Glu Gly Arg Thr Ser Arg Met Thr 
180 185 190 
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gtc cgc tac aca gtc gcc 

Val Arg Tyr Thr Val Ala 
195 

ttc gaa ate etc teg ctg 

Phe Glu lie Leu Ser Leu 
210 

gaa gac gca ggg ate cgc 

Glu Asp Ala Gly lie Arg 

225 230 

cga ggc ctg ttc gtc gga 
Arg Gly Leu Phe Val Gly 
245 



gaa ccc ace ggg ate egg 
Glu Pro Thr Gly lie Arg 
200 

ttc act gag gac gag tac 
Phe Thr Glu Asp Glu Tyr 
215 220 

gcg gaa tac ctt cct gga 
Ala Glu Tyr Leu Pro Gly 
235 

ate cgc aac tga 
lie Arg Asn 



gac ttc aca gag 624 

Asp Phe Thr Glu 

205 

acc gcc gcg etc 672 
Thr Ala Ala Leu 



gca ccg aac ggc 720 
Ala Pro Asn Gly 
240 

750 



<210> 36 
<211> 249 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 36 

Met Ser Arg Val Ser Asp Thr Phe Ala Glu Thr Ser Ser Val Tyr Ser 
15 10 15 

Pro Asp His Ala Asp He Tyr Asp Ala He His Ser Ala Arg Gly Arg 
20 25 30 



Asp Trp Ala Ala Glu Ala Gly Glu Val Val Gin Leu Val Arg Thr Arg 
35 40 45 

Leu Pro Glu Ala Gin Ser Leu Leu Asp Val Ala Cys Gly Thr Gly Ala 
50 55 60 

His Leu Glu Arg Phe Arg Ala Glu Tyr Ala Lys Val Ala Gly Leu Glu 
65 70 75 80 

Leu Ser Asp Ala Met Arg Glu He Ala He Arg Arg Val Pro Glu Val 
85 90 95 

Pro He His lie Gly Asp He Arg Asp Phe Asp Leu Gly Glu Pro Phe 
100 105 110 

Asp Val He Thr Cys Leu Cys Phe Thr Ala Ala Tyr Met Arg Thr Val 
115 120 125 

Asp Asp Leu Arg Arg Val Thr Arg Asn Met Ala Arg His Leu Ala Pro 
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130 135 140 

Gly Gly Val Ala Val He Glu Pro Trp Trp Phe Pro Asp Lys Phe He 

145 150 155 160 



Asp Gly Phe Val Thr Gly Ala Val 
165 

Ser Arg Leu Ser His Ser Val Leu 
180 

Val Arg Tyr Thr Val Ala Glu Pro 
195 200 

Phe Glu He Leu Ser Leu Phe- Thr 
210 215 

Glu Asp Ala Gly He Arg Ala Glu 
225 230 



Ala His His Gly Glu Arg Val He 
170 175 

Glu Gly Arg Thr Ser Arg Met Thr 
185 190 

Thr Gly lie Arg Asp Phe Thr Glu 
205 

Glu Asp Glu Tyr Thr Ala Ala Leu 
220 

Tyr Leu Pro Gly Ala Pro Asn Gly 
235 240 



Arg Gly Leu Phe Val Gly He Arg Asn 
245 



<210> 37 
<211> 726 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (723) 

<223> ORF16; 3 , 4 -Reduktase 



<400> 37 

atg age gaa cag acg att gca ctg gtc acc ggc gca aac aag gga ate 48 

Met Ser Glu Gin Thr He Ala Leu Val Thr Gly Ala Asn Lys Gly He 
15 10 15 



gga tac gag ate gcg gee ggg etc ggc gcg ctg ggg tgg age gtc gga 96 

Gly Tyr Glu He Ala Ala Gly Leu Gly Ala Leu Gly Trp Ser Val Gly 
20 25 30 

ate ggg gca egg gac cac cag cgc ggg gag gat gee gtg gcg aaa ttg 144 

He Gly Ala Arg Asp His Gin Arg Gly Glu Asp Ala Val Ala Lys Leu 
35 40 45 
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cgt gcg gac ggc gtc gat gcg ttc gcg gta tec ctg gac gtg aca gac 192 
Arg Ala Asp Gly Val Asp Ala Phe Ala Val Ser Leu Asp Val Thr Asp 
5.0 55 60 

gac gcg age gtc gcg get get gcg get ctg etc gag gag cgc gec ggc 240 
Asp Ala Ser Val Ala Ala Ala Ala Ala Leu Leu Glu Glu Arg Ala Gly 
65 70 75 80 

egg etc gat gtg ctg gtt aat aac gee ggc ate gee ggg gca tgg ccg 2 88 
Arg Leu Asp Val Leu Val Asn Asn Ala Gly lie Ala Gly Ala Trp Pro 
85 90 95 

gag gag ccc teg ace gtc aca ccg gcg age etc egg gcg gtg gtg gag 3 36 
Glu Glu Pro Ser Thr Val Thr Pro Ala Ser Leu Arg Ala Val Val Glu 
100 105 110 

ace aac gtg ate ggc gtc gtt egg gtt ace aac get atg ctg ccg ttg 3 84 
Thr Asn Val lie Gly Val Val Arg Val Thr Asn Ala Met Leu Pro Leu 
115 120 _ . 125 

eta cgc cgc tec gag cgc ccg egg ate gtc aac cag tec age cac gtc 4 32 
Leu Arg Arg Ser Glu Arg Pro Arg lie Val Asn Gin Ser Ser His Val 
130 135 140 

get tec ctg ace ttg caa ace acg ccg ggc gtc gac etc ggc ggg ate 480 
Ala Ser Leu Thr Leu Gin Thr Thr Pro Gly Val Asp Leu Gly Gly He 
145 150 155 160 

age gga gee tac tea ccg teg aag acg ttc etc aac gcg ate ace ate 52 8 
Ser Gly Ala Tyr Ser Pro Ser Lys Thr Phe Leu Asn Ala He Thr He 
165 170 175 

cag tac gee aag gaa etc age gat ace aac ate aaa ate aac aac gee 576 
Gin Tyr Ala Lys Glu Leu Ser Asp Thr Asn He Lys He Asn Asn Ala 
180 185 190 

tgc ccc ggc tac gtc gcg ace gac ctt aac ggc ttc cac gga ace age 624 
Cys Pro Gly Tyr Val Ala Thr Asp Leu Asn Gly Phe His Gly Thr Ser 
195 200 205 

acg ccg gca gac ggt gee agg ate gee att egg etc gec acg ctg cca 672 
Thr Pro Ala Asp Gly Ala Arg He Ala He Arg Leu Ala Thr Leu Pro 
210 215 220 

gac gac ggc ccg acc gga ggc atg ttc gac gac gee ggg aat gtg ccc 720 
Asp Asp Gly Pro Thr Gly Gly Met Phe Asp Asp Ala Gly Asn. Val Pro 
225 230 235 240 
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tgg tga 726 
Trp 



<210> 38 
<211> 241 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 38 

Met Ser Glu Gin Thr lie Ala Leu Val Thr Gly Ala Asn Lys Gly He 
1- 5 10 15 

Gly Tyr Glu He Ala Ala Gly Leu Gly Ala Leu Gly Trp Ser Val Gly 
20 25 30 

He Gly Ala Arg Asp His Gin Arg Gly Glu Asp Ala Val Ala Lys Leu 
35 40 45 

Arg Ala Asp Gly Val Asp Ala Phe Ala Val Ser Leu Asp Val Thr Asp 
50 55 60 

Asp Ala Ser Val Ala Ala Ala Ala Ala Leu Leu Glu Glu Arg Ala Gly 
65- 70 75 80 

Arg Leu Asp Val Leu Val Asn Asn Ala Gly He Ala Gly Ala Trp Pro 
85 90 95 

Glu Glu Pro Ser Thr Val Thr Pro Ala Ser Leu Arg Ala Val Val Glu 
100 105 HO 

Thr Asn Val He Gly Val Val Arg Val Thr Asn Ala Met Leu Pro Leu 
115 120 125 

Leu Arg Arg Ser Glu Arg Pro Arg He Val Asn Gin Ser Ser His Val 
130 135 140 

Ala Ser Leu Thr Leu Gin Thr Thr Pro Gly Val Asp Leu Gly Gly He 
145 150 155 160 

Ser Gly Ala Tyr Ser Pro Ser Lys Thr Phe Leu Asn Ala He Thr He 
165 170 175 

Gin Tyr Ala Lys Glu Leu Ser Asp Thr Asn He Lys He Asn Asn Ala 
180 185 190 

Cys Pro Gly Tyr Val Ala Thr Asp Leu Asn Gly Phe His Gly Thr Ser 
195 200 205 
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Thr Pro Ala Asp Gly Ala Arg He Ala He Arg Leu Ala Thr Leu Pro 
210 215 220 

Asp Asp Gly Pro Thr Gly Gly Met Phe Asp Asp Ala Gly Asn Val Pro 
225 230 235 240 

Trp 



<210> 39 
<211> 837 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (834) 

<223> ORF17; Transkriptions -Regulator 
<400> 39 

atg gag acg egg gag ttg egg tac ttc gtt gca gtc gec gag gag ttg 4 8 
Met Glu Thr Arg Glu Leu Arg Tyr Phe Val Ala Val Ala Glu Glu Leu 
15 10 15 

cac ttc ggc egg gec gec cag cgc ctg ggc ate gee cag ccg ccg ctg 96 
His Phe Gly Arg Ala Ala Gin Arg Leu Gly He Ala Gin Pro Pro Leu 
20 25 30 

teg egg acg ate gec cag etc gag caa cga etc gga gtc gtg ttg ctg 144 
Ser Arg Thr He Ala Gin Leu Glu Gin Arg Leu Gly Val Val Leu Leu 
35 40 45 

caa cgc ace age cgc aaa gtc teg etc acc gaa gec ggg gca atg ctg 192 
Gin Arg Thr Ser Arg Lys Val Ser Leu Thr Glu Ala Gly Ala Met Leu 
50 55 60 

ctg acc gaa ggc egg gcg ate etc ggc gcg ctg gca gca gee gag cga 240 
Leu Thr Glu Gly Arg Ala He Leu Gly Ala Leu Ala Ala Ala Glu Arg 
65 70 75 80 

cgc acc cag cgt gec gcg acg age cag ccc teg eta gtc ctg get gec 288 
Arg Thr Gin Arg Ala Ala Thr Ser Gin Pro Ser Leu Val Leu Ala Ala 
85 90 95 

aag gec ggc gee tec ggt gag ctg ctg gcg aag ttg etc gac gcg tac 336 
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Lys Ala Gly Ala Ser Gly Glu Leu Leu Ala Lys Leu Leu Asp Ala Tyr 
100 105 110 

gcc gcc gag ccg gga gcc gtg gcc gtc gac ctg ctg etc tgc gaa tec 384 
Ala Ala Glu Pro Gly Ala Val Ala Val Asp Leu Leu Leu Cys Glu Ser 
115 120 125 

cag ccc cag aaa acg ctg cat gac ggc egg gcc gac gtg gcg ctg ttg 432 
Gin Pro Gin Lys Thr Leu His Asp Gly Arg Ala Asp Val Ala Leu Leu 
130 135 140 



cat caa ccc ttc gac ccg acg gcc gaa etc gac ate gaa att ctg aac 4 80 
His Gin Pro Phe Asp Pro Thr Ala Glu Leu Asp He Glu He Leu Asn 
145 150 155 160 

acc gag caa caa gtc gcc att ctt ccg ace teg cat ccg ctt gcc age 528 
Thr Glu Gin Gin Val' Ala He Leu Pro Thr Ser His Pro Leu Ala Ser 
165 170 175 

gag ccc cat gta egg atg gcg gat gtc age tea ctg ccg gat etc ccg 576 
Glu Pro His Val Arg Met Ala Asp Val Ser Ser Leu Pro Asp Leu Pro 
180 185 190 

ctt gcg -cgc tgg ccc ggc ccc gac ggc gtc tat cca gat ggc ccc ggc 624 
Leu Ala Arg Trp Pro Gly Pro Asp Gly Val Tyr Pro Asp Gly Pro Gly 
195 200 205 



gtg gaa gta cgc aac cag acg caa ctg ttc caa atg ate gca etc ggc 672 

Val Glu Val Arg Asn Gin Thr Gin Leu Phe Gin Met He Ala Leu Gly 
210 215 220 

cgc act acc gtg gtc atg ccc gaa tec agt cgc gtc aac ctg etc gaa 720 

Arg Thr Thr Val Val Met Pro Glu Ser Ser Arg Val Asn Leu Leu Glu 

225 230 235 240 



ggc etc gcc gcc gta ccg gtt eta gac gcg ccg gac gtg acg aca gtc 768 
Gly Leu Ala Ala Val Pro Val Leu Asp Ala Pro Asp Val Thr Thr Val 
245 250 255 



ate gcc tgg ccg ccc cac age cgc tec cga gca etc gcc ggc ttg gtc 816 
He Ala Trp Pro Pro His Ser Arg Ser Arg Ala Leu Ala Gly Leu Val 
260 265 270 



cgc gtg gcc aca etc etc taa 
Arg Val Ala Thr Leu Leu 
275 



837 
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<210> 40 
<211> 278 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 40 

Met Glu Thr Arg Glu Leu Arg Tyr Phe Val Ala Val Ala Glu Glu Leu 
15 10 15 

His Phe Gly Arg Ala Ala Gin Arg Leu Gly lie Ala Gin Pro Pro Leu 
20 25 30 

Ser Arg Thr lie Ala Gin Leu Glu Gin Arg Leu Gly Val Val Leu Leu 
35 40 45 

Gin Arg Thr Ser Arg Lys Val Ser Leu Thr Glu Ala Gly Ala Met Leu 
• 50 55 60 

Leu Thr Glu Gly Arg Ala lie Leu Gly Ala Leu Ala Ala Ala Glu Arg 
65 70 75 80 

Arg Thr Gin Arg Ala Ala Thr Ser Gin Pro Ser Leu Val Leu Ala Ala 
85 90 95 

Lys Ala Gly Ala Ser Gly Glu Leu Leu Ala Lys Leu Leu Asp Ala Tyr 
100 105 110 

Ala Ala Glu Pro Gly Ala Val Ala Val Asp Leu Leu Leu Cys Glu Ser 
115 120 125 

Gin Pro Gin Lys Thr Leu His Asp Gly Arg Ala Asp Val Ala Leu Leu 
130 135 140 

His Gin Pro Phe Asp Pro Thr Ala Glu Leu Asp lie Glu lie Leu Asn 
145 150 155 160 

Thr Glu Gin Gin Val Ala lie Leu Pro Thr Ser His Pro Leu Ala Ser 
165 170 175 

Glu Pro His Val Arg Met Ala Asp Val Ser Ser Leu Pro Asp Leu Pro 
180 185 190 

Leu Ala Arg Trp Pro Gly Pro Asp Gly Val Tyr Pro Asp Gly Pro Gly 
195 200 205 

Val Glu Val Arg Asn Gin Thr Gin Leu Phe Gin Met lie Ala Leu Gly 
210 215 220 



151 



WO 01/16303 



PCT/EP00/08013 



Arg Thr Thr Val Val Met Pro Glu Ser Ser Arg Val Asn Leu Leu Glu 
225 230 235 240 

Gly Leu Ala Ala Val Pro Val Leu Asp Ala Pro Asp Val Thr Thr Val 
245 250 255 

lie Ala Trp Pro Pro His Ser Arg Ser Arg Ala Leu Ala Gly Leu Val 
260 265 270 

Arg Val Ala Thr Leu Leu 
275 



<210> 41 
<211> 7788 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (7785) 

<223> ORF18; Polyketidsynthase 

<400> 41 

atg age gaa gcc. ggg aac ctg ata gec gtc ate gga ctg tec tgc cgc 4 8 

Met Ser Glu Ala Gly Asn Leu lie Ala Val lie Gly Leu Ser Cys Arg 
15 10 15 

eta ccc cag gcg cct gac ccg get tec ttc tgg egg ttg ctg cgc acc 96 
Leu Pro Gin Ala Pro Asp Pro Ala Ser Phe Trp Arg Leu Leu Arg Thr 
20 25 30 

gga acg gac gcc ate acc acg gtc ccg gaa ggg egg tgg ggc gac ccg 144 • 
Gly Thr Asp Ala lie Thr Thr Val Pro Glu Gly Arg Trp Gly Asp Pro 
35 40 45 

ttg cct ggt egg gat gcg ccc aag ggc. ccg gaa tgg ggt ggt ttc ctg 192 
Leu Pro Gly Arg Asp Ala Pro Lys Gly Pro Glu Trp Gly Gly Phe Leu 
50 55 60 

get gat gtc gac tgc ttc gat ccc gag ttc ttc ggg ate teg ccg cga 240 
Ala Asp Val Asp Cys Phe Asp Pro Glu Phe Phe Gly He Ser Pro Arg 
65 70 75 80 

gaa gcg,gca gcc gtg gat ccc cag cag agg ctg get ctg gag etc gcc 288 
Glu Ala Ala Ala Val Asp Pro Gin Gin Arg Leu Ala Leu Glu Leu Ala 
85 90 95 
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tgg gag gca etc gaa gac gec ggt ate ccc gee ggc gag ctg cgc ggt 336 
Trp Glu Ala Leu Glu Asp Ala Gly lie Pro Ala Gly Glu Leu Arg Gly 
100 105 110 

act gee gee ggt gtg ttc atg ggg gcg ate tct gac gac tac gec gec 384 
Thr Ala Ala Gly Val Phe Met Gly Ala He Ser Asp Asp Tyr Ala Ala 
115 120 125. 

ctg ctg cgc gag age ccg ccg gaa gtg get gcg cag tac cgc etc acc 432 
Leu Leu Arg Glu Ser Pro Pro Glu Val Ala Ala Gin Tyr Arg Leu Thr 
130 135 140 

ggc acc cat cga age ctg ate gec aac cgc gtg tec tat gtg etc ggc 480 
Gly Thr His Arg Ser Leu He Ala Asn Arg Val Ser Tyr Val. Leu Gly 
145 150 155 160 

ctg cgc ggg cca age ctg acg gtg gat tea ggt cag tec teg tec ctg 528 
Leu Arg Gly Pro Ser Leu Thr Val Asp Ser Gly Gin Ser Ser Ser Leu 
165 170 175 

gtc ggc gtg cat etc gee age gag age ctg cga egg ggt gag tgc acg 576 
Val Gly Val His Leu Ala Ser Glu Ser Leu Arg Arg Gly Glu Cys Thr 
180 185 190 

ate gca etc gee ggc ggc gtg aac etc aac ctg gee gee gag age aac 624 
He Ala Leu Ala Gly Gly Val Asn Leu Asn Leu Ala Ala Glu Ser Asn 
195 200 205 

age get ctg atg gac ttc ggc gcg etc tec ccg gac ggt cgc tgc ttc 672 
Ser Ala Leu Met Asp Phe Gly Ala Leu Ser Pro Asp Gly Arg Cys Phe 
210 215 220 

acc ttc gat gtg egg gcg aac ggt tac gtc cgt ggt gag ggc ggc ggc 720 
Thr Phe Asp Val Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly Gly Gly 
225 230 235 240 

ctt gtc gtg ctg aag aag gee gat cag gcg cac gee gat ggc gac egg 7 68 
Leu Val Val Leu Lys Lys Ala Asp Gin Ala His Ala Asp Gly Asp Arg 
245 250 255 

ate tac tgc etc ate cgc ggc age gcg gtc aac aac gat ggg ggc ggt 816 
He Tyr Cys Leu He Arg Gly Ser Ala Val Asn Asn Asp Gly Gly Gly 
260 265 270 

gee ggg etc acc gtt ccg gcg gcg gac gee cag gcg gag ctg ctg cgc 864 
Ala Gly Leu Thr Val Pro Ala Ala Asp Ala Gin Ala Glu Leu Leu Arg 
275 280 285 
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cag gca tac egg aac gcg ggc gtc gac ccg gec gec gtg cag tat gtc 912 
Gin Ala Tyr Arg Asn Ala Gly Val Asp Pro Ala Ala Val Gin Tyr Val 
290 295 300 

gag etc cac ggc age gcg acc agg gtc ggg gat ccc gtc gaa gca gca 960 
Glu Leu His Gly Ser Ala Thr Arg Val Gly Asp Pro Val Glu Ala Ala 
305 310 315 320 

gec etc gga get gtc ctg ggg gcg gcg aga egg ccc ggc gac gag ctg 
Ala Leu Gly Ala Val Leu Gly Ala Ala Arg Arg Pro Gly Asp Glu Leu 
325 330 335 

cgt gtg ggg teg gcg aag acc aac gtc ggc cat ctg gaa gca gcg gcg 1056 
Arg Val Gly Ser Ala Lys Thr Asn Val Gly His Leu Glu Ala Ala Ala 
340 ■ 345 350 



1008 



ggc gtc acc. ggg ttg ctg aag acc gca etc age ate tgg cac cgc gaa 1104 
Gly Val Thr Gly Leu Leu Lys Thr Ala Leu Ser He Trp His Arg Glu 
355 360 365 

ctg ccg ccg agt ctt cat ttc acc gec ccc aac ccg gaa ate ccg ctg 1152 
Leu Pro Pro Ser Leu His Phe Thr Ala Pro Asn Pro Glu He Pro Leu 
370 375 380 

gac gaa ttg aac eta cgc gtc cag cgt gat ctg egg ccg tgg ccg gag 1200 
Asp Glu Leu Asn Leu Arg Val Gin Arg Asp Leu Arg Pro Trp Pro Glu 
385 390 395 400 

age gag ggg ccg ctg ctg gee ggc gtc age gec ttc gga atg gga ggc 124 8 
Ser Glu Gly Pro Leu Leu Ala Gly Val Ser Ala Phe Gly Met Gly Gly 
405 410 415 

acg aac tgc cac ctg gtg etc tec ggc acg tec egg gtg gag cga egg 1296 
Thr Asn Cys His Leu Val Leu Ser Gly Thr Ser Arg Val Glu Arg Arg 
420 425 430 

cgc agt gga ccc get gag gcg acc atg ccg tgg gtc ttg teg gec aga 1344 
Arg Ser Gly Pro Ala Glu Ala Thr Met Pro Trp Val Leu Ser Al'a Arg 
435 440 445 

aca ccg gtc gca ttg cgt gcg cag gcg gcg cgc ttg cac acg cac etc 1392 
Thr Pro Val Ala Leu Arg Ala Gin Ala Ala Arg Leu His Thr His Leu 
450 455 460 

aat acg gee ggt caa agt ccg ttg gac gtc gee tac tea ctg gcg acc 144 0 
Asn Thr Ala Gly Gin Ser Pro Leu Asp Val Ala Tyr Ser Leu Ala Thr 
465 470 475 480 
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act cga tec gcg ctg ccg cac egg gec gcg ctg gtc gcg gac gac gaa 1488 
Thr Arg Ser Ala Leu Pro His Arg Ala Ala Leu Val Ala Asp Asp Glu 
485 490 495 

ccg aaa ctg etc gec ggg ttg aag gec etc get gac ggc gac gac gcg 153 6 
Pro Lys Leu Leu Ala Gly Leu Lys Ala Leu Ala Asp Gly Asp Asp Ala 
500 505 510 

ccc acg ctg tgc cac ggc gcg act tec ggc gag egg gca gcg gtc ttc 1584 
Pro Thr Leu Cys His Gly Ala Thr Ser Gly Glu Arg Ala Ala Val Phe 
515 520 525 

gtc ttt ccc gga cag ggc age cag tgg ate ggg atg ggt agg cag ctg 1632 
Val Phe Pro Gly Gin Gly Ser Gin Trp lie Gly Met Gly Arg Gin Leu 
530 535 . 540 

etc gaa acc tec gag gtt ttc gcg gcg teg atg teg gac tgc gee gac 1680 
Leu Glu Thr Ser Glu Val Phe Ala Ala Ser Met Ser Asp Cys Ala Asp 
545 550 555 560 

gca ttg gcg ccg cac ctg gat tgg tec ctg ctg gat gtg ctg cgc aac 1728 
Ala Leu Ala Pro His Leu Asp Trp Ser Leu Leu Asp Val Leu Arg Asn 
565 570 575 

gcg gee ggc get gcg cac ctt gac cac gac gat gtc gtc cag ccc gcg 1776 
Ala Ala Gly Ala Ala His Leu Asp His Asp Asp Val Val Gin Pro Ala 
580 585 590 

ctg ttc gee ate atg gtc teg etc gcg gag etc tgg cgt teg tgg ggc 1824 
Leu Phe Ala lie Met Val Ser Leu Ala Glu Leu Trp Arg Ser Trp Gly 
595 600 605 

gtg cgt ccg gtg gcg gtc gtc ggg cac teg cag ggg gag ate gcg gcg 1872 
Val Arg Pro Val Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala 
610 615 620 

gee tgc gtc gee ggg gee ctg tec gtc cgc gat gec gec agg gtg gtg 192 0 
Ala Cys Val Ala Gly Ala Leu Ser Val Arg Asp Ala Ala Arg Val Val 
625 630 635 640 

gcg gtg cgc age agg ctt ctg acg gcg ctg gee ggc agt ggc gcg atg 1968 
Ala Val Arg Ser Arg Leu Leu Thr Ala Leu Ala Gly Ser Gly Ala Met 
645 650 655 

gee teg ttg cag cat ccc gee gaa gag gtg egg caa ate ctg ttg ccc 2 016 
Ala Ser Leu Gin His Pro Ala Glu Glu Val Arg Gin lie Leu Leu Pro 
660 665 670 
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tgg cgc gat egg ate ggc gtg gcg ggg gtg aac gga ccg teg teg ace 2064 
Trp Arg Asp Arg He Gly Val Ala Gly Val Asn Gly Pro Ser Ser Thr 
675 680 685 



ctg gtg tea ggg gac egg gag gcg atg gcg gaa ctg ctg gee gag tgc 2112 
Leu Val Ser Gly Asp Arg Glu Ala Met Ala Glu Leu Leu Ala Glu Cys 
690 695 700 

gca gac cga gag etc egg atg cgc egg att ccc gtt gaa tac gec tec 2160 
Ala Asp Arg Glu Leu Arg Met Arg Arg He Pro val Glu Tyr Ala Ser 
705 710 715 720 

cat teg cct cac ate gag gtt gtc egg gat gag ctg ctg ggg ctg ttg 2208 
His Ser Pro His He Glu Val Val Arg Asp Glu Leu Leu Gly Leu Leu 
725 730 735 



gcg ccg gtc gaa ccc agg acg gga age ate ccg ate tat teg acg acg 2256 
Ala Pro Val Glu- Pro Arg Thr Gly Ser He Pro He Tyr Ser Thr Thr 
740 745 750 

ace ggg gac ctg ctg gac egg ccg atg gac gee gac tac tgg tac cgc 2304 
Thr Gly Asp Leu Leu Asp Arg Pro Met Asp Ala Asp Tyr Trp Tyr Arg 
755 760 765 



aac ctt cgt caa ccg gtg ctg ttc gaa gcg gec gtc gag gec ctg ttg 2352 
Asn Leu Arg Gin Pro Val Leu Phe Glu Ala Ala Val Glu Ala Leu Leu 
770 775 780 

aag egg ggg tac gac gca ttc ate gag ate age cca cac ccg gtg ctg 2400 
Lys Arg Gly Tyr Asp Ala Phe He Glu He Ser Pro His Pro Val Leu 
785 790 795 800 



act gcg aac ate cag gaa acc gee gtg cga gca ggg egg gag gta gtg 2448 
Thr Ala Asn He Gin Glu Thr Ala Val Arg Ala Gly Arg Glu Val Val 
805 810 815 



gcg etc ggg aca etc cgc cgc ggc gaa ggt ggc atg egg cag gcg ctg 2496 
Ala Leu Gly Thr Leu Arg Arg Gly Glu Gly Gly Met Arg Gin Ala Leu 
820 825 830 



acg teg ctg gee aga gca cac gta cac gga gtg gee gcg gac tgg cac 2544 

Thr Ser Leu Ala Arg Ala His Val His Gly Val Ala Ala Asp Trp His 
835 840 845 

gcg gtc ttc gec ggt acc ggg gcg cag egg gtc gac ctg ccg acg tac 2592 

Ala Val Phe Ala Gly Thr Gly Ala Gin Arg Val Asp Leu Pro Thr Tyr 
850 855 860 
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gcc ttt cag cga cag cgc tac tgg ctg gac gcg aag ctt ccc gac gtc 2640 
Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ala Lys Leu Pro Asp Val 
865 870 875 880 

gcc atg ccc gag age gac gtg teg acg gcg ttg egg gaa aag ctg egg 268 8 
Ala Met Pro Glu Ser Asp Val Ser Thr Ala Leu Arg Glu Lys Leu Arg 
885 890 895 

tct teg ccg agg gcg gac gtg gac teg acg acc etc acg atg ate egg 2736 
Ser Ser Pro Arg Ala Asp Val Asp Ser Thr Thr Leu Thr Met lie Arg 
900 • 905 910 

gca cag gca gcc gtg gtc etc ggc cac tec gat ccg aaa gag gtg gac 2784 
Ala Gin Ala Ala Val Val Leu Gly His Ser Asp Pro Lys Glu Val Asp 
915 920 925 

ccg gat egg acg ttc aag gac ctg ggc ttc gat tec teg "atg gtg gtc 2832 
Pro Asp Arg Thr Phe Lys Asp Leu Gly Phe Asp Ser Ser Met Val Val 
930 935 940 

gag ctg tgc gac cgc eta aac gcc gcc aca ggt ctg cga etc gca ccg 2880 
Glu Leu Cys Asp Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Ala Pro 
945 950 955 960 

age gtc gtt ttc gac tgt cct acg ccg gac aag etc gcc cgc cag gta 2928 
Ser Val Val Phe Asp Cys Pro Thr Pro Asp Lys Leu Ala Arg Gin Val 
965 970 975 

egg acg ttg ttg ttg ggc gag ccg get ccc atg acg tea cac egg ccg 2 976 
Arg Thr Leu Leu Leu Gly Glu Pro Ala Pro Met Thr Ser His Arg Pro 
980 985 990 

gac tec gat gcg gac gag cct ate gcc gtg ate ggg atg ggc tgt egg 3 024 
Asp Ser Asp Ala Asp Glu Pro lie Ala Val lie Gly Met Gly Cys Arg 
995 1000 1005 

ttt ccg ggt ggg gtg tec teg ccc gag gag ttg tgg cag ttg gtc gcc 3072 
Phe Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Gin Leu Val Ala 
1010 1015 1020 

get ggg egg gac gtc gtg tec gag ttc ccg get gac cga ggt tgg gac 3120 
Ala Gly Arg Asp Val Val Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp 
1025 1030 1035 1040 

ctg gag cgt gcg ggg aca teg cac gtg cgc gcc ggc ggg ttc ttg cat 3168 
Leu Glu Arg Ala Gly Thr Ser His Val Arg Ala Gly Gly Phe Leu His 
1045 1050 1055 
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ggc gcc ccg gat ttt gac ccc ggg ttc ttc egg att teg ccg cgc gag 3216 
Gly Ala Pro Asp Phe Asp Pro Gly Phe Phe Arg He Ser Pro Arg Glu 
1060 1065 1070 

gcg ttg gcg atg gat cca cag cag egg ttg ctg ctg gaa ate gcc tgg 3264 
Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu He Ala Trp 
1075 1080 1085 

gaa gca gtc gaa cga ggc ggg ate aac ccg cag cac ctg cac gga agt 3312 
Glu Ala Val Glu Arg Gly Gly He Asn Pro Gin His Leu His Gly Ser 
1090 1095 1100 

caa acc ggg gtc ttc gtc ggc gcg ace tec ctg gac tac ggg cca cgc 3360 
Gin Thr Gly Val Phe Val Gly Ala Thr Ser Leu Asp Tyr Gly Pro Arg 
1105 1110 1115 1120 

ctg cac gaa gcg tec gag gag gcg gcc ggg tac gtg etc acc ggc age 3408 
Leu His Glu Ala Ser Glu Glu Ala Ala Gly Tyr. Val Leu Thr Gly Ser 
1125 1130 1135 

acc acg agt gtg gcg teg ggt egg gtt gcg tat teg ttc ggg ttc gag 3456 
Thr Thr Ser Val Ala Ser Gly Arg Val Ala Tyr Ser Phe Gly Phe Glu 
1140 1145 1150 

ggc cct gcg gtg acg gtg gat acg gcg tgt teg teg teg ttg gtg gcc 3504 
Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
1155 1160 1165 

ctg cat ttg gcg tgt cag teg ttg cgt teg ggt gag tgt gat ctg gcg 3552 
Leu His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala 
1170 1175 1180 

ttg gcc ggt ggt gtg acc gtg atg gcc acg ccg ggg atg ttc gtg gag 3600 
Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Gly Met Phe Val Glu 
1185 1190 1195 1200 

ttt teg egg cag cgt ggt ttg gcg ccg gat ggg egg tgc aag teg ttc 3648 
Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Sex Phe 
1205 1210 1215 

gcg gag gcc gcc gac ggc acc ggc tgg tec gag ggt get ggc ctg gtt 3696 
Ala Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Ala Gly Leu Val 
1220 1225 1230 

eta ctg gag egg ttg teg gat gcc egg egg aat ggg cat gag gtg ctg 3744 
Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Glu Val Leu 
1235 1240 1245 
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gcg gtt gtt cgt ggt agt gcg gtg aat cag gac ggt gcg teg aat ggt 3792 
Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
1250 1255 1260 

ttg acc gcg ccg aat ggt teg teg cag cag egg gtg att gee cag gca 3840 
Leu Thr Ala Pro Asn Gly Ser Ser Gin Gin Arg Val lie Ala Gin Ala 
1265 1270 1275 1280 

ttg gcg agt gcg ggg ttg teg gtg tec gat gtg gat get gtg gag gcg 3888 
Leu Ala Ser Ala Gly Leu Ser Val Ser Asp Val Asp Ala Val Glu Ala 
1285 1290 1295 

cat ggg acg ggc acg egg ctt ggt gat ccg ate gag gcg cag gcg ctg 3936 
His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu 
1300 1305 1310 

ate gec acc tac ggc cag ggc egg ctt ccg gaa egg cca ttg tgg ttg 3984 
lie Ala Thr Tyr Gly Gin Gly Arg Leu Pro Glu Arg Pro Leu Trp Leu 
1315 1320 1325 

ggc teg atg aag teg aac ate ggt cac gcg cag gca get gcg ggg ata 4 032 
Gly Ser Met Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly He 
1330 1335 1340 

gec ggc gtc atg aag atg gtg atg gcg atg egg cac ggg cag eta ccg 4080 
Ala Gly Val Met Lys Met Val Met Ala Met Arg His Gly Gin Leu Pro 
1345 1350 1355 1360 

cgc acg ttg cac gtg gat gag ccg act tct ggg gtg gat tgg teg gcg 4128 
Arg Thr Leu His Val Asp Glu Pro Thr Ser Gly Val Asp Trp Ser Ala 
1365 1370 1375 

ggg acg gtt caa etc ctt acg gag aac acg ccc tgg ccc ggg agt ggt 4176 
Gly Thr Val Gin Leu Leu Thr Glu Asn Thr Pro Trp Pro Gly Ser Gly 
1380 1385 1390 

cgt gtt cgt egg gtg ggg gtg teg teg ttc ggg ate agt ggt act aac 4224 
Arg Val Arg Arg Val Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn 
1395 1400 1405 

gcg cac gtc ate etc gaa cag ccc ccg gga gtg ccg agt cag tct gcg 4272 
Ala His Val He Leu Glu Gin Pro Pro Gly Val Pro Ser Gin Ser Ala 
1410 1415 1420 

ggg ccg ggt teg ggc tct gtc gtg gat gtt ccg gtg gtg ccg tgg atg 4320 
Gly Pro Gly Ser Gly Ser Val Val Asp Val Pro Val Val Pro Trp Met 
1425 1430 1435 1440 
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gtg teg ggc aaa aca ccc gaa gcg eta tec gcg cag gca acg gcg ttg 4368 
Val Ser Gly Lys Thr Pro Glu Ala Leu Ser Ala Gin Ala Thr Ala Leu 
1445 1450 1455 

atg ace tat ctg gac gag cga cct gat gtc tec teg ctg gat gtt ggg 4416 
Met Thr Tyr Leu Asp Glu Arg Pro Asp Val Ser Ser Leu Asp Val Gly 
1460 1465 1470 

tac teg ctg gcg ttg aca egg teg gcg ctg gat gag cga gcg gtg gtg 4464 
Tyr Ser Leu Ala Leu Thr Arg Ser Ala Leu Asp Glu Arg Ala Val Val 
1475 1480 1485 

ctg ggg teg gac cgt gaa acg ttg ttg tgc ggt gtg aaa gcg ctg tct 4512 
Leu Gly Ser Asp Arg Glu Thr Leu Leu Cys Gly Val Lys Ala Leu Ser 
1490 1495 1500 

gec ggt cat gag get tct ggg ttg gtg ace gga tct gtg ggg get ggg 4560 
Ala Gly His Glu Ala Ser Gly Leu Val Thr Gly Ser Val Gly Ala Gly 
1505 1510 1515 1520 

ggc cgc ate ggg ttt gtg ttt tec ggt cag ggt ggt cag tgg ctg ggg 4608 
. Gly Arg lie Gly Phe Val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly 
1525 1530 1535 

atg ggc egg ggg ctt tac egg get ttt ccg gtg ttc get get gee ttt 4656 
Met Gly Arg Gly Leu Tyr Arg Ala Phe Pro Val Phe Ala Ala Ala Phe 
1540 1545 1550 

gac gaa get tgt gee gag ctg gat gcg cat ctg ggc cag gaa ate ggg 4704 
Asp Glu Ala Cys Ala Glu Leu Asp Ala His Leu Gly Gin Glu He Gly 
1555 1560 1565 

gtt egg gag gtg gtg tec ggt teg gat gcg cag ttg ctg gat egg acg 4752 
Val Arg Glu Val Val Ser Gly Ser Asp Ala Gin Leu Leu Asp Arg Thr 
1570 1575 1580 

ttg tgg gcg cag teg ggt ttg ttc gcg ttg cag gtg ggc ttg ctg aag 4800 
Leu Trp Ala Gin Ser Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Lys 
1585 1590 1595 1600 

ttg ctg gat teg tgg ggg gtt egg ccg agt gtg gtg ttg ggg cat. teg 4 84 8 
Leu Leu Asp Ser Trp Gly Val Arg Pro Ser Val Val Leu Gly His Ser 
1605 1610 1615 



gtg ggc gag ttg gcg gcg gcg ttc gcg gcg ggt gtg gtg teg ttg teg 4 896 
Val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Val Ser Leu Ser 
1620 1625 1630 
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ggt gcg get egg ttg gtg gcg ggt cgt gee egg ttg atg cag gcg ttg 4 944 
Gly Ala Ala Arg Leu Val Ala Gly Arg. Ala Arg Leu Met Gin Ala Leu 
1635 1640 1645 

ccg tct ggc ggt ggg atg ctg gcg gtg cct get ggt gag gag ctg ttg 4 992 
Pro Ser Gly Gly Gly Met Leu Ala Val Pro Ala Gly Glu Glu Leu Leu 
1650 1655 1660 

tgg teg ttg ttg gee gat cag ggt gat cgt gtg ggg ate gee gcg gtc 504 0 
Trp Ser Leu Leu Ala Asp Gin Gly Asp Arg Val Gly lie Ala Ala Val 
1665 1670 1675 1680 

aac get gcg ggg teg gtg gtg etc tct ggt gat egg gat gtg etc gat 5088 
Asn Ala Ala Gly Ser Val Val Leu Ser Gly Asp Arg Asp Val Leu Asp 
1685 1690 1695 

gac ctt gee ggt egg ctg gac ggg caa ggg ate egg teg agg tgg ttg 513 6 
Asp Leu Ala Gly Arg Leu Asp Gly Gin Gly lie Arg Ser Arg Trp Leu 
1700 1705 1710 

egg gtg teg cat gcg ttt cat teg tat egg atg gat ccg atg ctg gcg 5184 
Arg Val Ser His Ala Phe His Ser Tyr Arg Met Asp Pro Met Leu Ala 
1715 1720 1725 

gag ttc gee gaa ttg gca cga acc gtg gat tac egg cgt tgt gaa gtg 5232 
Glu Phe Ala Glu Leu Ala Arg Thr Val Asp Tyr Arg Arg Cys Glu Val 
1730 1735 1740 

ccg ate gtg teg acc ttg acc gga gac etc gat gac get ggc agg atg 52 80 
Pro lie Val Ser Thr Leu Thr Gly Asp Leu Asp Asp Ala Gly Arg Met 
1745 1750 1755 1760 

age ggg ccc gac tac tgg gtg cgt cag gtg cga gag ccg gtc cgc ttc 532 8 
Ser Gly Pro Asp Tyr Trp Val Arg Gin Val Arg Glu Pro Val Arg Phe 
1765 1770 1775 

gee gac ggt gtc cag gcg ctg gtc gag cac gat gtg gee acc gtt gtc 5376 
Ala Asp Gly Val Gin Ala Leu Val Glu His Asp Val Ala Thr Val Val 
1780 1785 1790 

gag etc ggt ccg gac ggg gcg ttg teg gcg ctg ate cag gaa tgt gtc 5424 
Glu Leu Gly Pro Asp Gly Ala Leu Ser Ala Leu lie Gin Glu Cys Val 
1795 1800 1805 

gee gca tec gat cac gee ggg egg ctg age gcg gtc ccg gcg atg cgc 54 72 
Ala Ala Ser Asp His Ala Gly Arg Leu Ser Ala Val Pro Ala Met Arg 
1810 1815 1820 
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agg aac cag gac gag gcg cag aag gtg atg acg gcc ctg gca cac gtc 5520 
Arg Asn Gin Asp Glu Ala Gin Lys Val Met Thr Ala Leu Ala His Val 
1825 1830 1835 1840 

cac gta cgt ggt ggt gcg gtg gac tgg egg teg ttc ttc gcc ggt aca 5568 
His Val Arg Gly Gly Ala Val Asp Trp Arg Ser Phe Phe Ala Gly Thr 
1845 1850 1855 

agg gcg aag caa ate gag ctg ccc acc tac gcc ttc caa cga cag egg 5616 
Arg Ala Lys Gin lie Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg 
1860 1865 1870 

tac tgg ctg aac gcg ctg cgt gaa tct tec gcc ggc gac atg ggc agg 5664 
Tyr Trp Leu Asn Ala Leu Arg Glu Ser Ser Ala Gly Asp Met Gly Arg 
1875 1880 1885 

cgt gtc gaa gcg aag ttc tgg ggc gcc gtc gag cac gaa gat gtg gaa 5712 
Arg Val Glu Ala Lys Phe Trp Gly Ala Val Glu His Glu Asp Val Glu 
1890 1895 1900 

teg ctt gca cgc gta ttg ggc att gtg gac gac ggc get get gtg gat 5760 
Ser Leu Ala Arg Val Leu Gly lie Val Asp Asp Gly Ala Ala Val Asp 
1905 1910 1915 1920 

tec ctg aga age gcc ctt ccg gtg ttg gcc ggt tgg cag cga acc cgc 5808 
Ser. Leu Arg Ser Ala Leu Pro Val Leu Ala Gly Trp Gin Arg Thr Arg 
1925 1930 1935 

acc acc gag tec att atg gat cag egg tgt tac cga att ggc tgg egg 5 856 
Thr Thr Glu Ser lie Met Asp Gin Arg Cys Tyr Arg lie Gly Trp Arg 
1940 1945 1950 

cag gta gcc gga etc ccg ccg atg gga act gtt ttc ggt acc tgg ctg 5904 
Gin Val Ala Gly Leu Pro Pro Met Gly Thr Val Phe Gly Thr Trp Leu 
1955 1960 1965 

gtc ttc gcg cct cat ggc tgg tec age gaa ccg gag gtg gtg gac tgc 5952 
Val Phe Ala Pro His Gly Trp Ser Ser Glu Pro Glu Val Val Asp Cys 
1970 1975 1980 

gtt acg gca ctg egg gca cgt ggt gcc teg gtg gtg ttg gtg gaa get 6000 
Val Thr Ala Leu Arg Ala Arg Gly Ala Ser Val Val Leu Val Glu Ala 
1985 1990 1995 2000 

gat ccc gac ccg acc tec ttc ggc gac egg gta cga acc ctg tgt teg 604 8 
Asp Pro Asp Pro Thr Ser Phe Gly Asp Arg Val Arg Thr Leu Cys Ser 
2005 2010 2015 
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ggc ctt ccg gat ctt gtt ggc gtg ttg tea atg ttg tgc ttg gaa gaa 6096 
Gly Leu Pro Asp Leu Val Gly Val Leu Ser Met Leu Cys Leu Glu Glu 
.2020 2025 2030 

teg gtc ctt ccg gga ttt tct gcg gtg tea egg ggt ttt gcg ttg acc 6144 
Ser Val Leu Pro Gly Phe Ser Ala Val Ser Arg Gly Phe Ala Leu Thr 
2035 2040 2045 

gtg gag ttg gtg egg gtt ttg egg gca get ggt gcg act gee egg ttg 6192 
Val Glu Leu Val Arg Val Leu Arg Ala Ala Gly Ala Thr Ala Arg Leu 
2050 2055 2060 

tgg ttg ctg acg tgt ggt ggc gtg teg gtg gga gat gta ccg gtt cgt 6240 
Trp Leu Leu Thr Cys Gly Gly Val Ser Val Gly Asp Val Pro Val Arg 
2065 2070 2075 2080 

cca gcg cag gee ctg gcg tgg ggg ttg ggg cgt gtt gtg ggg ttg gag 6288 
Pro Ala Gin Ala Leu Ala Trp Gly Leu Gly Arg Val Val Gly Leu Glu 
2085 2090 2095 

cat ccg gac tgg tgg ggc ggc ttg ate gat att ccg gtc ttg ttc gac 6336 
His Pro Asp Trp Trp Gly Gly Leu lie Asp He Pro Val Leu Phe Asp 
2100 2105 2110 

gaa gac get caa gag egg ttg teg att gtg ctg gca ggt etc gat gag 6384 
Glu Asp Ala Gin Glu Arg Leu Ser He Val Leu Ala Gly Leu Asp Glu 
2115 2120 2125 

gac gag gtc gcg ate cgt cct gac ggc atg ttc gcg cgt egg ttg gta 6432 
Asp Glu Val Ala He Arg Pro Asp Gly Met Phe Ala Arg Arg Leu Val 
2130 2135 2140 

cgc cac act. gtc tea get gat gtg aag aag gcg tgg cgc ccc agg gga 6480 
Arg His Thr Val Ser Ala Asp Val Lys Lys Ala Trp Arg Pro Arg Gly 
2145 2150 2155 2160 

teg gtg ctg gtg acg ggc ggc acg ggt ggt ttg ggg gcg cac gtt get 6528 
Ser Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Ala His Val Ala 
2165 2170 2175 

cgc tgg ctg gee gac gee gga gee gaa cat gtg gcg atg gtg agt cga 6576 
Arg Trp Leu Ala Asp Ala Gly Ala Glu His Val Ala Met Val Ser Arg 
2180 2185 2190 

cgc ggc gag cag gca ccg agt get gag aag ttg egg acg gaa ctg gag 6624 
Arg Gly Glu Gin Ala Pro Ser Ala Glu Lys Leu Arg Thr Glu Leu Glu 
2195 2200 2205 
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gat ctg ggt acc egg gtg teg ate gtg 
Asp Leu Gly Thr Arg Val Ser. lie Val 
2210 2215 

gag gcg etc gee gaa gtg ctg aaa gee 
Glu Ala Leu Ala Glu Val Leu Lys Ala 
2225 2230 



tea tgc gat gtg acc gat cgc 6672 
Ser Cys Asp Val Thr Asp Arg 
2220 

ctt ccg get gaa aac ccg ttg 6720 
Leu Pro Ala Glu Asn Pro Leu 
2235 2240 



acc gcg gta gtg cat gcg gca ggc gtg ate gag act ggt gat gcg gcg 6768 
Thr Ala Val Val His Ala Ala Gly Val He Glu Thr Gly Asp Ala Ala 
2245 2250 2255 

gca atg age ctg get gat ttc gat cac gtg ttg tec gca aag gtg gee 6816 
Ala Met Ser Leu Ala Asp Phe Asp His Val Leu Ser Ala Lys Val Ala 
2260 2265 2270 

ggt gee gcg. aat ctg gat gee ttg ttg gee gat gtg gaa ttg gac gcg 6864 
Gly Ala Ala Asn Leu Asp Ala Leu Leu Ala Asp Val Glu Leu Asp Ala 
2275 2280 2285 

ttc gtc ttg ttc tea teg gtg tea gga gtt tgg ggc get ggg gga cac 6912 
Phe Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Ala Gly Gly His 
2290 2295 2300 

ggg get tac gca gcg gcg aat gee tat ctg gat gcg etc gcg gaa cag 6960 
Gly Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin 
2305 2310 2315 2320 

cgt egg teg cga ggg ctg gtc gcg act gcg gtg gee tgg ggg ccg tgg 7008 
Arg Arg Ser Arg Gly Leu Val Ala Thr Ala Val Ala Trp Gly Pro Trp 
2325 2330 2335 

gee ggc gag ggc atg gee tec gga gaa aca gga gac cag ctg cgc cga 7056 
Ala Gly Glu Gly Met Ala Ser Gly Glu Thr Gly Asp Gin Leu Arg Arg 
2340 2345 2350 

tac ggc ctt tec cca atg get ccg cag cac gee ate gee gga ate egg 7104 
Tyr Gly Leu Ser Pro Met Ala Pro Gin His Ala He Ala Gly He Arg 
2355 2360 2365 

cag gee gtg gaa cag gac gaa att tec ctg gta gtg gee gat gtc gat 7152 
Gin Ala Val Glu Gin Asp Glu He Ser Leu Val Val Ala Asp Val Asp 
2370 2375 2380 

tgg gca cgt ttc age gcg gga ttg ctg gcg get agg ccg egg ccg ctg 7200 
Trp Ala Arg Phe Ser Ala Gly Leu Leu Ala Ala Arg Pro Arg Pro Leu 
2385 2390 2395' 2400 
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ctg aac gaa ctg gcc gag gtc aag gaa etc etc gtc gat gec cag ccc 724 8 
Leu Asn Glu Leu Ala Glu Val Lys Glu Leu Leu Val Asp Ala Gin Pro 
2405 2410 2415 

gag gcg gga gtc ctt gcc gac gcg teg ttg gaa tgg egg cag cga ttg 7296 
Glu Ala Gly Val Leu Ala Asp Ala Ser Leu Glu Trp Arg Gin Arg Leu 
2420 2425 2430 

tec gcg gca ccg agg ccg aca cag gaa cag ctg ate ctg gag ctg gta 7344 
Ser Ala Ala Pro Arg Pro Thr Gin Glu Gin Leu lie Leu Glu Leu Val 
2435 2440 2445 

cgc ggc gaa acc get ctg gtg ctg gga cac ccc ggg gca gcg gcc gtt 7392 
Arg Gly Glu Thr Ala Leu Val Leu Gly His Pro Gly Ala Ala Ala Val 
2450 2455 2460 

gca teg gaa cga gcc ttc aag gac age gga ttc gac teg cag gcc gcg 7440 
Ala Ser Glu Arg Ala Phe Lys Asp Ser Gly Phe Asp Ser Gin Ala Ala 
2465 2470 2475 2480 

gtc gaa etc cgc gtt egg etc aat cga get acc ggc etc cag ttg cca 7488 
Val Glu Leu Arg Val Arg Leu Asn Arg Ala Thr Gly Leu Gin Leu Pro 
2485 2490 2495 

teg aca att ate ttc age cat ccc acg cct gcg gaa ctg get gcg gag 7536 
Ser Thr He He Phe Ser His Pro Thr Pro Ala Glu Leu Ala Ala Glu 
2500 2505 2510 

ctg egg gcg agg ctt ctt ccc gag tec gca gga gca ggc att ccc gag 7584 ' 
Leu Arg Ala Arg Leu Leu Pro Glu Ser Ala Gly Ala Gly He Pro Glu 
2515 . 2520 2525 

gag gac gag gcg cga ate aga gcg gca ctg acg teg ate ccg ttc ccg 7632 
Glu Asp Glu Ala Arg He Arg Ala Ala Leu Thr Ser He Pro Phe Pro 
2530 2535 2540 

gcc ttg cgc gag gca ggc ttg gtg agt ccg ctg etc gca ctt gcc gga 7680 
Ala Leu Arg Glu Ala Gly Leu Val Ser Pro Leu Leu Ala Leu Ala Gly 
2545 2550 2555 2560 

cac ccg gtc gac tec ggt ate tec teg gac gat gcg gcc gcg acc teg 7728 
His Pro Val Asp Ser Gly He Ser Ser Asp Asp Ala Ala Ala Thr Ser 
2565 2570 2575 

ate gat gcg atg gat gta gcc ggc etc gtc gaa gca gcg ctg ggc gaa 7776 
He Asp Ala Met Asp Val Ala Gly Leu Val Glu Ala Ala Leu Gly Glu 
2580 2585 2590 
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cgc gag tec tga 7788 
Arg Glu Ser 
2595 



<210> 42 
<211> 2595 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 42 

Met Ser Glu Ala Gly Asn Leu He Ala Val He Gly Leu Ser Cys Arg 
15 10 15 

Leu Pro Gin Ala Pro Asp Pro Ala Ser Phe Trp Arg Leu Leu Arg Thr 
20 25 30 

Gly Thr Asp Ala He Thr Thr Val Pro Glu Gly Arg Trp Gly Asp Pro 
35 40 45 

Leu Pro Gly Arg Asp Ala Pro Lys Gly Pro Glu Trp Gly Gly Phe Leu 
50 55 60 

Ala Asp Val Asp Cys Phe Asp Pro Glu Phe Phe Gly He Ser Pro Arg 
65 70 75 80 

Glu Ala Ala Ala Val Asp Pro Gin Gin Arg Leu Ala Leu Glu Leu Ala 
85 90 95 

Trp Glu Ala Leu Glu Asp Ala Gly He Pro Ala Gly Glu Leu Arg Gly 
100 105 110 

Thr Ala Ala Gly Val Phe Met Gly Ala He Ser Asp Asp Tyr Ala Ala 
115 120 125 

Leu Leu Arg Glu Ser Pro Pro Glu Val Ala Ala Gin Tyr Arg Leu Thr 
130 135 140 

Gly Thr His Arg Ser Leu He Ala Asn Arg Val Ser Tyr Val Leu Gly 
145 150 155 160 

Leu Arg Gly Pro Ser Leu Thr Val Asp Ser Gly Gin Ser Ser Ser Leu 
165 170 175 

Val Gly Val His Leu Ala Ser Glu Ser Leu Arg Arg Gly Glu Cys Thr 
180 185 190 
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He Ala Leu Ala Gly Gly Val Asn Leu Asn Leu Ala Ala Glu Ser Asn 
195 200 205 

Ser Ala Leu Met Asp Phe Gly Ala Leu Ser Pro Asp Gly Arg* Cys Phe 
210 215 220 

Thr Phe Asp Val Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly Gly Gly 
225 230 235 240 

Leu Val Val Leu Lys Lys Ala Asp Gin Ala His Ala Asp Gly Asp Arg 
245 250 255 

He Tyr Cys Leu He Arg Gly Ser Ala Val Asn Asn Asp Gly Gly Gly 
260 265 270 

Ala Gly Leu Thr Val Pro Ala Ala Asp Ala Gin Ala Glu Leu Leu Arg 
275 280 285 

Gin Ala Tyr Arg Asn Ala Gly Val Asp Pro Ala Ala Val Gin Tyr Val 
290 295 300 

Glu Leu His Gly Ser Ala Thr Arg Val Gly Asp Pro Val Glu Ala Ala 
305 310 315 320 

Ala Leu Gly Ala Val Leu Gly Ala Ala Arg Arg Pro Gly Asp Glu Leu 
325 330 335 

Arg Val Gly Ser Ala Lys Thr Asn Val Gly His Leu Glu Ala Ala Ala 
340 345 350 

Gly Val Thr Gly Leu Leu Lys Thr Ala Leu Ser He Trp His Arg Glu 
355 360 365 

Leu Pro Pro Ser Leu His Phe Thr Ala Pro Asn Pro Glu He Pro Leu 
370 375 380 

Asp Glu Leu Asn Leu Arg Val Gin Arg Asp Leu Arg Pro Trp Pro Glu 
385 390 395 400 

Ser Glu Gly Pro Leu Leu Ala Gly Val Ser Ala Phe Gly Met Gly Gly 
405 410 415 

Thr Asn Cys His Leu Val Leu Ser Gly Thr Ser Arg Val Glu Arg Arg 
420 425 430 

Arg Ser Gly Pro Ala Glu Ala Thr Met Pro Trp Val Leu Ser Ala Arg 
435 440 445 
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Thr Pro Val Ala Leu 
450 

Asn Thr Ala Gly Gin 
465 

Thr Arg Ser Ala Leu 
485 

Pro Lys Leu Leu Ala 
500 

Pro Thr Leu Cys His 
515 

Val Phe Pro Gly Gin 
530 

Leu Glu Thr Ser Glu 
545 

Ala Leu Ala Pro His 
565 

Ala Ala Gly Ala Ala 
580 

Leu Phe Ala He Met 
595 

Val Arg Pro Val Ala 
610 

Ala Cys Val Ala Gly 
625 

Ala Val Arg Ser Arg 
645 

Ala Ser Leu Gin His 
660 

Trp Arg Asp Arg He 
675 

Leu Val Ser Gly Asp 
690 



Arg Ala Gin Ala Ala Arg 
455 

Ser Pro Leu Asp Val Ala 
470 475 

Pro His Arg Ala Ala Leu 
490 

Gly Leu Lys Ala Leu Ala 
505 

Gly Ala -Thr Ser Gly Glu 
520 

Gly Ser Gin Trp He Gly 
535 

Val Phe Ala Ala Ser Met 
550 555 

Leu Asp Trp Ser Leu Leu 
570 

His Leu Asp His Asp Asp 
585 

Val Ser Leu Ala Glu Leu 
600 

Val Val Gly His Ser Gin 
615 

Ala Leu Ser Val Arg Asp 
630 635 

Leu Leu Thr Ala Leu Ala 
650 

Pro Ala Glu Glu Val Arg 
665 

Gly Val Ala Gly Val Asn 
680 

Arg Glu Ala Met Ala Glu 
695 



Leu His Thr His Leu 
460 

Tyr Ser Leu Ala Thr 
480 

Val Ala Asp Asp Glu 
495 

Asp Gly Asp Asp Ala 
510 

Arg Ala Ala Val Phe 
525 

Met Gly Arg Gin Leu 
54 0 

Ser Asp Cys Ala Asp 
. 560 

Asp Val Leu Arg Asn 
575 

Val Val Gin Pro Ala 
590 

Trp Arg Ser Trp Gly 
605 

Gly Glu He Ala Ala 
620 

Ala Ala Arg Val Val 
640 

Gly Ser Gly Ala Met 
655 

Gin He Leu Leu Pro 
670 

Gly Pro Ser Ser Thr 
685 

Leu Leu Ala Glu Cys 
700 
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Ala Asp Arg Glu Leu Arg Met Arg Arg lie Pro Val Glu Tyr Ala Ser 
705 710 715 720 

His Ser Pro His lie Glu Val Val Ar$ Asp Glu Leu Leu Gly Leu Leu 
725 730 735 

Ala Pro Val Glu Pro Arg Thr Gly Ser lie Pro lie Tyr Ser Thr Thr 
740 745 750 

Thr Gly Asp Leu Leu Asp Arg Pro Met Asp Ala Asp Tyr Trp Tyr Arg . 
755 760 765 

Asn Leu Arg Gin Pro Val Leu Phe Glu Ala Ala Val Glu Ala Leu Leu 
770 775 780 



Lys Arg Gly Tyr Asp Ala Phe lie Glu lie Ser Pro His Pro Val Leu 
785 790 795 800 

Thr Ala Asn. He Gin Glu Thr Ala Val Arg Ala Gly Arg Glu Val Val 
805 810 815 



Ala Leu Gly Thr Leu Arg Arg Gly Glu Gly Gly Met Arg Gin Ala Leu 
820 825 830 

Thr Ser Leu Ala Arg Ala His Val His Gly Val Ala Ala Asp Trp His 
835 840 845 

Ala Val Phe Ala Gly Thr Gly Ala Gin Arg Val Asp Leu Pro Thr Tyr 
850 855 860 

Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ala Lys Leu Pro Asp Val 
865 870 875 880 

Ala Met Pro Glu Ser Asp Val Ser Thr Ala Leu Arg Glu Lys Leu Arg 
885 890 895 

Ser Ser Pro Arg Ala Asp Val Asp Ser Thr Thr Leu Thr Met He Arg 
900 905 910 

Ala Gin Ala Ala Val Val Leu Gly His Ser Asp Pro Lys Glu Val Asp 
915 920 925 

Pro Asp Arg Thr Phe Lys Asp Leu Gly Phe Asp Ser Ser Met Val Val 
930 935 940 



Glu Leu Cys Asp Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Ala Pro 
945 950 955 960 
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Ser Val Val Phe Asp Cys Pro Thr Pro Asp Lys Leu Ala Arg Gin Val 
965 970 975 

Arg Thr Leu Leu Leu Gly Glu Pro Ala Pro Met Thr Ser His Arg Pro- 
980 985 990 

Asp Ser Asp Ala Asp Glu Pro He Ala Val He Gly Met Gly Cys Arg 
995 1000 1005 

Phe Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Gin Leu Val Ala 
1010 1015 1020 

Ala Gly Arg Asp Val Val Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp 
025 1030 1035 1040 

Leu Glu Arg Ala Gly Thr Ser His. Val Arg Ala Gly Gly Phe Leu His 
1045 1050 1055 

Gly Ala Pro Asp Phe Asp Pro Gly Phe Phe Arg He Ser Pro Arg Glu 
1060 1065 1070 

Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu He Ala Trp 
1075 1080 1085 

Glu Ala Val Glu Arg Gly Gly He Asn Pro Gin His Leu His Gly Ser 
1090 1095 1100 

Gin Thr Gly Val Phe Val Gly Ala Thr Ser Leu Asp Tyr Gly Pro Arg 
105 1110 1115 1120 

Leu His Glu Ala Ser Glu Glu Ala Ala Gly Tyr Val Leu Thr Gly Ser 
1125 1130 1135 

Thr Thr Ser Val Ala Ser Gly Arg Val Ala Tyr Ser Phe Gly Phe Glu 
1140 1145 1150 

Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
1155 1160 1165 

Leu His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala 
1170 1175 1180 

Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Gly Met Phe Val Glu 
185 1190 1195 1200 

Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe 
1205 1210 1215 
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Ala Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Ala Gly Leu Val 
1220 1225 1230 

Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Glu Val- Leu '* 
1235 1240 1245 

Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
1250 1255 1260 

Leu Thr Ala Pro Asn Gly Ser Ser Gin Gin Arg Val lie Ala Gin Ala 
265 1270 1275 1280 

Leu Ala Ser Ala Gly Leu Ser Val Ser Asp Val Asp Ala Val Glu Ala 
1285 1290 1295 

His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu 
1300 1305 1310 

He Ala Thr Tyr Gly Gin Gly Arg Leu Pro Glu Arg Pro Leu Trp Leu 
1315 1320 1325 

Gly Ser Met Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly He 
1330 1335 1340 

Ala Gly Val Met Lys Met Val Met Ala Met Arg His Gly Gin Leu Pro 
345 1350 1355 1360 

Arg Thr Leu His Val Asp Glu Pro Thr Ser Gly Val Asp Trp Ser Ala 
1365 1370 1375 

Gly Thr Val Gin Leu Leu Thr Glu Asn Thr Pro Trp Pro Gly Ser Gly 
1380 1385 1390 

Arg Val Arg Arg Val Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn 
1395 1400 1405 

Ala His Val He Leu Glu Gin Pro Pro Gly Val Pro Ser Gin Ser Ala 
1410 1415 1420 

Gly Pro Gly Ser Gly Ser Val Val Asp Val Pro Val Val Pro Trp Met 
425 1430 1435 1440 

Val Ser Gly Lys Thr Pro Glu Ala Leu Ser Ala Gin Ala Thr Ala Leu 
1445 1450 1455 

Met Thr Tyr Leu Asp Glu Arg Pro Asp Val Ser Ser Leu Asp Val Gly 
1460 1465 1470 
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1475 14B0 

Leu Gly Ser Asp Arg Glu Thr Leu Leu 
1490 1495 

Ala Gly His Glu Ala Sex Gly Leu Val 
505 1510 



PCT/EP00/08013 

Leu Asp Glu Arg Ala Val Val 
1485 

Cys Gly Val Lys Ala Leu Ser 
1500 

Thr Gly Ser Val Gly Ala Gly 
1515 1520 



Gly Arg lie Gly Phe Val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly 
1525 1530 1535 

Met Gly Arg Gly Leu Tyr Arg Ala Phe Pro Val Phe Ala Ala Ala Phe 
1540 1545 1550 

Asp Glu Ala Cys Ala Glu Leu Asp Ala His Leu Gly Gin Glu lie Gly 
1555 1560 1565 

Val Arg Glu Val Val Ser Gly Ser Asp Ala Gin Leu Leu Asp Arg Thr 
1570 1575 1580 

Leu Trp Ala Gin Ser Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Lys 
585 1590 1595 1600 

Leu Leu Asp Ser Trp Gly Val Arg Pro Ser Val Val Leu Gly His Ser 
1605 1610 1615 

Val Gly Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Val Ser Leu Ser 
1620 1625 1630 

Gly Ala Ala Arg Leu Val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu 
1635 1640 1645 

Pro Ser Gly Gly Gly Met Leu Ala Val Pro Ala Gly Glu Glu Leu Leu 
1650 1655 1660 



Trp Ser Leu Leu Ala Asp Gin Gly Asp Arg Val Gly lie Ala Ala Val 
665 1670 1675 1680 

Asn Ala Ala Gly Ser Val Val Leu Ser Gly Asp Arg Asp Val Leu Asp 
1685 1690 1695 

Asp Leu Ala Gly Arg Leu Asp Gly Gin Gly lie Arg Ser Arg Trp Leu 
1700 1705 1710 



Arg Val Ser His Ala Phe His Ser Tyr Arg Met Asp Pro Met Leu Ala 
1715 1720 1725 
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Glu Phe Ala Glu Leu Ala Arg Thr Val Asp Tyr Arg Arg Cys Glu Val 
1730 1735 1740 

Pro He Val Ser Thr Leu Thr Gly Asp Leu Asp Asp Ala Gly Arg Met 
745 1750 1755 1760 

Ser Gly Pro Asp Tyr Trp Val Arg Gin Val Arg Glu Pro Val Arg Phe 
1765 1770 1775 

Ala Asp Gly Val Gin Ala Leu Val Glu His Asp Val Ala Thr Val Val 
1780 1785 1790 

Glu Leu Gly Pro Asp Gly Ala Leu Ser Ala Leu He Gin Glu Cys Val 
1795 1800 1805 

Ala Ala Ser Asp His Ala Gly Arg Leu Ser Ala Val Pro Ala Met Arg 
1810 1815 1820 

Arg Asn Gin Asp Glu Ala Gin Lys Val Met Thr Ala Leu Ala His Val 
825 1830 1835 1840 

His Val Arg Gly Gly Ala Val Asp Trp Arg Ser Phe Phe Ala Gly Thr 
1845 1850 1855 

Arg Ala Lys Gin He Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg 
1860 1865 1870 

Tyr Trp Leu Asn Ala Leu Arg Glu Ser Ser Ala Gly Asp Met Gly Arg 
1875 1880 1885 

Arg Val Glu Ala Lys Phe Trp Gly Ala Val Glu His Glu Asp Val Glu 
1890 1895 1900 

Ser Leu Ala Arg Val Leu Gly He Val Asp Asp Gly Ala Ala Val Asp 
905 1910 1915 1920 

Ser Leu Arg Ser Ala Leu Pro Val Leu Ala Gly Trp Gin Arg Thr Arg 
• 1925 1930 1935 

Thr Thr Glu Ser He Met Asp Gin Arg Cys Tyr Arg He Gly Trp Arg 
1940 1945 1950 

Gin Val Ala Gly Leu Pro Pro Met Gly Thr Val Phe Gly Thr Trp Leu 
1955 1960 1965 

Val Phe Ala Pro His Gly Trp Ser Ser Glu Pro Glu Val Val Asp Cys 
1970 1975 1980 
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Val Thr Ala Leu Arg Ala Arg Gly Ala Ser Val Val Leu Val Glu Ala 
985 1990 1995 2000 

Asp Pro Asp Pro Thr SerPhe Gly Asp Arg Val Arg Thr Leu Cys Ser 
2005 2010 2015 

Gly Leu Pro Asp Leu Val Gly Val Leu Ser Met Leu Cys Leu Glu Glu 
2020 2025 2030 

Ser Val Leu Pro Gly Phe Ser Ala Val Ser Arg Gly Phe Ala Leu Thr 
2035 2040 2045 

Val Glu Leu Val Arg Val Leu Arg Ala Ala Gly Ala Thr Ala Arg Leu 
2050 2055 2060 

Trp Leu Leu Thr Cys Gly Gly Val Ser Val Gly Asp Val Pro Val Arg 
065 2070 2075 2080 

Pro Ala Gin Ala Leu Ala Trp Gly Leu Gly Arg Val Val Gly Leu Glu 
2085 2090 2095 

His Pro Asp Trp Trp Gly Gly Leu lie Asp lie Pro Val Leu Phe Asp 
2100 2105 2110 

Glu Asp Ala Gin Glu Arg Leu Ser lie Val Leu Ala Gly Leu Asp Glu 
2115 2120 2125 

Asp Glu Val Ala lie Arg Pro Asp Gly Met Phe Ala Arg Arg Leu Val 
2130 2135 2140 

Arg His Thr Val Ser Ala Asp Val Lys Lys Ala Trp Arg Pro Arg Gly 
145 2150 2155 2160 

Ser Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Ala His Val Ala 
2165 2170 2175 

Arg Trp Leu Ala Asp Ala Gly Ala Glu His Val Ala Met Val Ser Arg 
2180 2185 2190 

Arg Gly Glu Gin Ala Pro Ser Ala Glu Lys Leu Arg Thr Glu Leu Glu 
2195 2200 2205 

Asp Leu Gly Thr Arg Val Ser lie Val Ser Cys Asp Val Thr Asp Arg 
2210 2215 2220 

Glu Ala Leu Ala Glu Val Leu Lys Ala Leu Pro Ala Glu Asn Pro Leu 
225 2230 2235 2240 
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Thr Ala Val Val His Ala Ala Gly Val lie Glu Thr Gly Asp Ala Ala 
2245' 2250 2255 

Ala Met Ser Leu Ala Asp Phe Asp His Val Leu Ser Ala Lys Val Ala 
2260 2265 2270 

Gly Ala Ala Asn Leu Asp Ala Leu Leu Ala Asp Val Glu Leu Asp Ala 
2275 2280 2285 

Phe Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Ala Gly Gly His 
2290 2295 2300 

Gly Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin 
305 2310 2315 2320 

Arg Arg Ser Arg Gly Leu Val Ala Thr Ala Val Ala Trp Gly Pro Trp 
2325 2330 2335 

Ala Gly Glu Gly Met Ala Ser Gly Glu Thr Gly Asp Gin Leu Arg Arg 
2340 2345 2350 . 

Tyr Gly Leu Ser Pro Met Ala Pro Gin His Ala lie Ala Gly lie Arg 
2355 2360 2365 

Gin Ala Val Glu Gin Asp Glu lie Ser Leu Val Val Ala Asp Val Asp 
2370 2375 2380 

Trp Ala Arg Phe Ser Ala Gly Leu Leu Ala Ala Arg Pro Arg Pro Leu 
385 2390 2395 2400 

Leu Asn Glu Leu Ala Glu Val Lys Glu Leu Leu Val Asp Ala Gin Pro 
2405 2410 2415 

Glu Ala Gly Val Leu Ala Asp Ala Ser Leu Glu Trp Arg Gin Arg Leu 
2420 2425 2430 

Ser Ala Ala Pro Arg Pro Thr Gin Glu Gin Leu lie Leu Glu Leu Val 
2435 2440 2445 

Arg Gly Glu Thr Ala Leu Val Leu Gly His Pro Gly Ala Ala Ala Val 
2450 2455 2460 

Ala Ser Glu Arg Ala Phe Lys Asp Ser Gly Phe Asp Ser Gin Ala Ala 
465 2470 2475 2480 

Val Glu Leu Arg Val Arg Leu Asn Arg Ala Thr Gly Leu Gin Leu Pro 
2485 2490 2495 
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• Ser Thr He lie Phe Ser His Pro Thr Pro Ala Glu Leu Ala Ala Glu 
2500 2505 2510 

Leu Arg Ala Arg Leu Leu Pro Glu Ser Ala Gly Ala Gly He Pro Glu 
2515 2520 2525 

Glu Asp Glu Ala Arg He Arg Ala Ala Leu Thr Ser He Pro Phe Pro 
2530 2535 2540 

Ala Leu Arg Glu Ala Gly Leu Val Ser Pro Leu Leu Ala Leu Ala Gly 
545 2550 2555 2560 

His Pro Val Asp Ser Gly He Ser Ser Asp Asp Ala Ala Ala Thr Ser 
2565 2570 2575 

He Asp Ala Met Asp- Val Ala Gly Leu Val Glu Ala Ala Leu Gly Glu 
2580 2585 2590 

Arg Glu Ser 
2595 



<210> 43 
<211> 6459 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1)..(6456) 

<223> ORF19; Polyketidsynthase 

<400> 43 

atg acg gtg acc acc agt tac gaa gaa gtt gtc gag gca ctg cga gca 4 8 

Met Thr Val Thr Thr Ser Tyr Glu Glu Val Val Glu Ala Leu Arg Ala 
1.5 10 15 

teg etc aag gag aac gaa cgc etc egg cgc ggc agg gat egg tfcc tec 96 
Ser Leu Lys Glu Asn Glu Arg Leu Arg Arg Gly Arg Asp Arg Phe Ser 
20 25 30 

gcg gag aag gac gat ccc ate gcg ate gtg gcg atg agt tgt cgt tat 144 
Ala Glu Lys Asp Asp Pro He Ala He Val Ala Met Ser Cys Arg Tyr 
35 40 45 

ccc ggt cag gtc tec teg ccg gag gac ctg tgg caa ctg get gee ggc 192 
Pro Gly Gin Val Ser Ser Pro Glu Asp Leu Trp Gin Leu Ala Ala Gly 
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50 55 60 

ggt gtg gac gcg ate tec gaa gtt ccg ggg gat cgc gga tgg gac ctg 24 0 
Gly Val Asp Ala lie Ser Glu Val Pro Gly Asp Arg Gly Trp Asp Leu 
65 70 75 80 

gat ggc gtg ttc gtt ccg gac tec gat cgt cct ggc acg teg tat gec 288 
Asp Gly Val Phe Val Pro Asp Ser Asp Arg Pro Gly Thr Ser Tyr Ala 
85 90 95 

tgc gcg ggc ggt ttt ctt cag ggc gtg teg gag. ttc gac gcg ggt ttc 336 
Cys Ala Gly Gly Phe Leu Gin Gly Val Ser Glu Phe Asp Ala Gly Phe 
100 105 110 

ttc ggg att teg ccg cgt gag gcg ctg gcg atg gat ccg cag cag egg 384 
Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
115 120 125 

ttg ctg ctg gaa gtc gcg tgg gag gtc ttc gag egg get ggg ctg gag 432 
Leu Leu Leu Glu Val Ala Trp Glu Val Phe Glu Arg Ala Gly Leu Glu 
130 135 140 

cag egg teg aca cgc ggt tec cgc gtt ggc gtg ttc gtc ggc ace aat 480 
Gin Arg Ser Thr Arg Gly Ser Arg Val Gly Val Phe Val Gly Thr Asn 
145 150 ' 155 160 

ggc cag gac tac gcg teg tgg ttg egg acg ccg ccg cct gcg gtg gca 528 
Gly Gin Asp Tyr Ala Ser Trp Leu Arg Thr Pro Pro Pro Ala Val Ala 
165 170 175 

ggt cat gtg ctg acg ggc ggt gcg gca gcg gtt ctt teg ggc egg gtt 576 
Gly His Val Leu Thr Gly Gly Ala Ala Ala Val Leu Ser Gly Arg Val 
180 185 190 

gcg tat teg ttc ggg ttc gag ggt cct gcg gtg acg gtg gat acg gcg 624 
Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala 
195 200 205 

tgt teg teg teg ttg gtg gcg ttg cac ctg gcg ggg caa gca ctg egg 672 
Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ala Leu Arg 
210 215 220 

gee ggt gag tgc gac ctt gec ctt gec ggt ggc gtc acg gtg atg teg 720 
Ala Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser 
225 230 235 240 

acg ccg aag gtg ttc ctg gag ttc tec cgc caa egg ggt etc gcg ccg 768 
Thr Pro Lys Val Phe Leu Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro 
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245 250 255 

gat ggg egg tgc aag teg ttc gcg gcg ggt gcg gat ggc act gga tgg 816 
Asp Gly Arg Cys Lys Ser Phe Ala Ala Gly Ala Asp Gly Thr Gly Trp 
260 265 270 

ggt gag ggt gee gga ctg ttg ttg ctg gag egg ttg teg gat gee egg 864 
Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg 
275 280 285 

egg aat ggg cat gag gtg ctg gcg gtt gtt cgt ggt agt gcg gtg aat 912 
Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser Ala Val Asn 
290 295 300 

cag gac ggt gcg teg aat ggt ttg acc gcg ccg aat ggt teg teg cag 960 
Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Ser Ser Gin 
305 310 315 320 

cag egg gtg att acc cag gcg ttg gcg agt gcg ggg ttg teg gtg tec 1008 
Gin Arg Val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser Val Ser 
325 330 335 

gat gtg gat get gtg gag gcg cat ggg acg ggc acg egg ctt ggt gat 1056 
Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp 
340 345 350 

ccg ate gag gcg cag gcg ctg ate gec acc tac ggc cgt gat cgt gat 1104 
Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr Gly Arg Asp Arg Asp 
355 360 365 

cct ggc egg ccg ttg tgg ttg ggg teg gtc aag teg aac ate ggt cat 1152 
Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly His 
370 375 380 

acg caa gcg gcg gcg ggt gtg get ggt gtg ate aag atg gtg atg gcg 1200 
Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Met Ala 
385 390 395 400 

atg egg cac ggg cag ctg cca cgc acg ttg cac gtg gaa teg ccg teg 124 8 
Met Arg His Gly Gin Leu Pro Arg Thr Leu His Val Glu Ser Pro Ser 
405 410 415 

ccg gag gtg gat tgg teg gcg ggg acg gtt caa etc ctt acg gag aac 1296 
Pro Glu Val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu Asn 
420 425 430 

acg ccc tgg ccc agg agt ggt cgt gtt cgt egg gtg ggg gtg teg teg 1344 
Thr Pro Trp Pro Arg Ser Gly Arg Val Arg Arg Val Gly Val Ser Ser 
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435 440 

ttc ggg ate agt ggt act aac gcg cac 
Phe Gly lie Ser Gly Thr Asn Ala His 
450 455 

gga gtg ccg agt cag tct gcg ggg ccg 
Gly Val Pro Ser Gin Ser Ala Gly Pro 
465 470 

gtt ccg gtg gtg ccg tgg atg gtg teg 
Val Pro Val Val Pro Trp Met Val Ser 
485 



445 

gtc ate etc gaa cag ccc ccg 1392 
Val lie Leu Glu Gin Pro Pro 
460 

ggt teg ggt tct gtc gtg gat 1440 
Gly Ser Gly Ser Val Val Asp 
475 480 

ggc aaa aca ccc gaa gcg eta 1488 
Gly Lys Thr Pro Glu Ala Leu 
490 495 



tec gcg cag gca acg gcg ttg atg ace tat ctg gac gag cga cct gat 1536 
Ser Ala Gin Ala Thr Ala Leu Met Thr Tyr Leu Asp Glu Arg Pro Asp 
500 505 510 



gtc tec teg ctg gat gtt ggg tac teg ctg gcg ttg aca egg teg gcg 1584 
Val Ser Ser Leu Asp Val Gly Tyr Ser Leu Ala Leu Thr Arg Ser Ala 
515 520 525 

ctg gat gag cga gcg gtg gtg ctg ggg teg gac cgt gaa acg ttg ttg 1632 
Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Glu Thr Leu Leu 
530 535 540 

tgc ggt gtg aaa gcg ctg tct gee ggt cat gag get tct ggg ttg gtg 1680 
Cys Gly Val Lys Ala Leu Ser Ala Gly His Glu Ala Ser Gly Leu Val 
545 550 555 560 



ace gga tct gtg ggg get gg<g ggc cgc ate ggg ttt gtg ttt tec ggt 172 8 
Thr Gly Ser Val Gly Ala Gly Gly Arg He Gly Phe. Val Phe Ser Gly 
565 570 575 



cag ggt ggt cag tgg ctg ggg atg ggc egg ggg ctt tac egg get ttt 1776 
Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu Tyr Arg Ala Phe 
580 585 590 



ccg gtg ttc get get gee 
Pro Val Phe Ala Ala Ala 
595 

cat ctg ggc cag gaa ate 
His Leu Gly Gin Glu He 
610 

gcg cag ttg ctg gat egg 
Ala Gin Leu Leu Asp Arg 



ttt gac gaa get tgt gee 
Phe Asp Glu Ala Cys Ala 
600 

ggg gtt egg gag gtg gtg 
Gly Val Arg Glu Val Val 
615 620 

acg ttg tgg gcg cag teg 
Thr Leu Trp Ala Gin Ser 



gag ctg ga?t gca 1824 

Glu Leu Asp Ala 

605 

tec ggt teg gat 1872 
Ser Gly Ser Asp 



ggt ttg ttc gcg 1920 
Gly Leu Phe Ala 
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625 630 635 640 

ttg cag gtg ggc ttg ctg aag ttg ctg gat teg tgg ggg'gtt egg ccg 1968 
Leu Gin Val Gly Leu Leu Lys Leu Leu Asp Ser Trp Gly Val Arg Pro 
645 650 655 

agt gtg gtg ttg ggg cat teg gtg ggc gag ttg gcg gcg gcg ttc gcg 2016 
Ser Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala 
660 665 670 

gcg ggt gtg gtg teg ttg teg ggt gcg get egg ttg gtg gcg ggt cgt 2064 
Ala Gly Val Val Ser Leu Ser Gly Ala Ala Arg Leu Val Ala Gly Arg 
675 680 685 

gee egg ttg atg cag gcg ttg ccg tct ggc ggt ggg atg ctg gcg gtg 2112 
Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Gly Met Leu Ala Val 
690 695 700 

cct get ggt gag gag ctg ttg tgg teg ttg ttg gee gat cag ggt gat 2160 
Pro Ala Gly Glu Glu Leu Leu Trp Ser Leu Leu Ala Asp Gin Gly Asp 
705 710 715 720 

cgt gtg ggg ate gee gcg gtc aac get gcg ggg teg gtg gtg etc tct 2208 
Arg Val Gly He Ala Ala Val Asn Ala Ala Gly Ser Val Val Leu Ser 
725 730 735 

ggt gat egg gat gtg etc gat gac ctt gee ggt egg ctg gac ggg caa 2256 
Gly Asp Arg Asp Val Leu Asp Asp Leu Ala Gly Arg Leu Asp Gly Gin 
740 745 750 

ggg ate egg teg agg tgg ttg egg gtg teg cat gcg ttt cat teg tat 2304 
Gly He Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser Tyr 
755 760 765 

egg atg gat ccg atg ctg gcg gag ttc gee gaa tt§ gca cga acc gtg 2352 
Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Thr Val 
770 775 780 

gat tac egg cgt tgt gaa gtg ccg ate gtg teg acc ttg acc gga gac 2400 
Asp Tyr Arg Arg Cys Glu Val Pro He Val Ser Thr Leu Thr - Gly Asp 
785 790 795 800 

etc gat gac get ggc agg atg age ggg ccc gac tac tgg gtg cgt cag 2448 
Leu Asp Asp Ala Gly Arg Met Ser Gly Pro Asp Tyr Trp Val Arg Gin 
805 810 815 

gtg cga gag ccg gtc cgc ttc gee gac ggt gtc cag gcg ctg gtc gag 2496 
val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Gin Ala Leu Val Glu 
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820 825 830 

cac gat gtg gcc act gtt gtc gag etc ggt ccg gac ggg gcg ttg teg 2544 
His Asp Val Ala Thr Val Val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

gcg ctg ate cag gaa tgt gtc gcc gca tec gat cac gcc ggg egg ctg 2592 
Aia Leu lie Gin Glu Cys Val Ala Ala Ser Asp His Ala Gly Arg Leu 
850 855 860 

age gcg gtc ccg gcg atg cgc agg aac cag gac gag gcg cag aag gtg 2640 
Ser . Ala Val Pro Ala Met Arg Arg Asn Gin Asp Glu Ala Gin Lys Val 
865 870 875 880 

atg acg gcc ctg gca cac gtc cac gta cgt ggt ggt gcg gtg gac tgg 2688 
Met Thr Ala Leu Ala His Val His Val Arg Gly .Gly Ala Val Asp Trp 
885 890 895 

egg teg ttc ttc gcc ggt acg gga gcg aaa caa ate gag ctg ccc acc 2736 
Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin lie Glu Leu Pro Thr 
900 905 910 

tac gcc ttc caa cga cag egg tac tgg ctg gtg cca teg gat tec ggt 2784 
Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Val Pro Ser Asp Ser Gly 
915 920 925 

gat gtg aca ggt gcc ggt ctg gcc ggg gcg gag cat ccg ctg ttg ggt 2 832 
Asp Val Thr Gly Ala Gly Leu Ala Gly Ala Glu His Pro Leu Leu Gly 
930 935 940 

get gtg gtg ccg gtc gcg ggt ggt gac gag gtg ttg ctg acc ggc agg 2880 
Ala Val Val Pro Val Ala Gly Gly Asp Glu Val Leu Leu Thr Gly Arg 
945 950 955 960 

att teg gtg egg acg cat ccg tgg ctg gcc gaa cac egg gtg ctg ggt 2 928 
He Ser Val Arg Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly 
965 970 975 

gaa gtg ate gtt gcg ggc acc gcg ttg ctg gag ate gcc ttg cac gcg 2976 
Glu Val He Val Ala Gly Thr Ala Leu Leu Glu lie Ala Leu His Ala 
980 985 990 

ggg gaa cgt ctt ggt tgt gaa egg gtg gaa gag etc acc ctg gaa gca 3 024 
Gly Glu Arg Leu Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala 
995 1000 1005 

ccg ctg gtc ctg ccg gag cgc ggg gcg ate cag gtt cag ctg cga gtg 3 072 
Pro Leu Val Leu Pro Glu Arg Gly Ala He Gin Val Gin Leu Arg Val 
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1010 1015 1020 



ggc gcg ccc gag aat tec gga cgc agg ccg atg gcg ctg tat tea cgc 3120 
Gly Ala Pro Glu Asn Ser Gly Arg Arg Pro Met Ala Leu Tyr Ser Arg 
1025 1030 1035 1040 



ccc gaa ggg gcg gcg gag cat gac tgg acg egg cac gee acg ggc egg 3168 
Pro Glu Gly Ala Ala Glu His Asp Trp Thr Arg His Ala Thr Gly Arg 
1045 1050 1055 



ttg gcg cca ggc cgc ggc gag gcg get gga gac ctg gee gac tgg ccg 3216 
Leu Ala Pro Gly Arg Gly Glu Ala Ala Gly Asp Leu Ala Asp Trp Pro 
1060 1065 1070 



get cct ggc gcg ctg ccg gtc gac etc gac gaa ttc tat egg gac etc 3264 
Ala Pro Gly Ala Leu Pro Val Asp Leu Asp Glu Phe Tyr Arg Asp Leu 
1075 1080 1085 



gca gag ctt ggg ctg gag tac ggc ccg ate ttc caa ggg etc aag gcg 3312 
Ala Glu Leu Gly Leu Glu Tyr Gly Pro He Phe Gin Gly Leu Lys Ala 
1090 1095 1100 



gec tgg egg caa ggg gac gag gtg tac gee gaa gec gcg ctg ccg gga 3360 
Ala Trp Arg Gin Gly Asp Glu Val Tyr Ala Glu Ala Ala Leu Pro Gly 
1105 1110 1115 1120 

acg gaa gat tct ggt ttc ggg gtg cat ccg gca ctg ctg gac gcg get 3408 
Thr Glu Asp Ser Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala 
1125 1130 1135 

ctg cac gca acg get gtc cga gac atg gat gac gca cgc ttg ccg ttc 3456 
Leu His Ala Thr Ala Val Arg Asp Met Asp Asp Ala Arg Leu Pro Phe 
1140 1145 1150 



cag tgg gaa ggt gtg tec ctg cac gee aag gee gcg ccg get ttg egg 3504 
Gin Trp Glu Gly Val Ser Leu His Ala Lys Ala Ala Pro Ala Leu Arg 
1155 1160 1165 

gtc cgc gtg gtc ccg get ggt gac gat gee aag tec ctg ctg gt't tgt 3552 
Val Arg Val Val Pro Ala Gly Asp Asp Ala Lys Ser Leu Leu Val Cys 
1170 1175 1180 

gat ggc acc ggt cga ccg gtg ate teg gtg gac cga etc gta ttg egg 3600 
Asp Gly Thr Gly Arg Pro Val He Ser Val Asp Arg Leu Val Leu Arg 
1185 1190 1195 1200 



teg get gcg gee egg egg acc ggt gcg cgc cga cag gee cat caa get 3648 
Ser Ala Ala Ala Arg Arg Thr Gly Ala Arg Arg Gin Ala His Gin Ala 
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1205 1210 1215 

egg ttg tac egg ttg age tgg cca acg gtt caa ctg ccg aca tec get 3696 

Arg Leu Tyr Arg Leu Ser Trp Pro Thr Val Gin Leu Pro Thr Ser Ala 
1220 1225 1230 

cag cca ccg tec tgc gtg ctt etc ggc acc tea gaa gtg tec get gac 3744 

Gin Pro Pro Ser Cys Val Leu Leu Gly Thr Ser Glu Val Ser Ala Asp 
1235 1240 1245 



ata cag gtg tat ccg gac etc egg teg ttg acg get gcg ttg gat gee 3792 
lie Gin Val Tyr Pro Asp Leu Arg Ser Leu Thr Ala Ala Leu Asp Ala 
1250 1255 1260 



ggt gee gaa cca ccc ggc gtc gtc ate gca ccc acg ccc ccc ggc ggt 3840 
Gly Ala Glu Pro Pro Gly Val Val lie Ala Pro Thr Pro Pro Gly Gly 
1265 1270 1275 1280 



gga cga aca gcg gat gtc egg gag acg act egg cat gca etc gac ctg 3888 
Gly Arg Thr Ala Asp Val Arg Glu Thr Thr Arg His Ala Leu Asp Leu 
1285 1290 1295 



gta caa ggc tgg ctt tec gat cag cga etc aac gaa tec cga ttg etc 3936 
Val Gin Gly Trp Leu Ser Asp Gin Arg Leu Asn Glu Ser Arg Leu Leu 
1300 1305 1310 



ctg gtg aca cag gga gca gtg gec gtg gag ccg ggc gaa ccc gtg acc 3984 
Leu Val Thr Gin Gly Ala Val Ala Val Glu Pro Gly Glu Pro Val Thr 
1315 1320 1325 

gat ctg gcg cag gec gcg etc tgg gga ctg ctg egg teg acg cag acc 4 032 
Asp Leu Ala Gin Ala Ala Leu Trp Gly Leu Leu Arg Ser Thr Gin Thr 
1330 1335 1340 



gaa cac cct gat cgc ttc gtc etc gtc gat gtg cct gag ccc gcg caa 4 080 
Glu His Pro Asp Arg Phe Val Leu Val Asp Val Pro Glu Pro Ala Gin 
1345 1350 1355 1360 



etc etc ccc gcg ctg ccg ggg gtg ctg gee tgc ggc gaa cct cag etc 4128 
Leu Leu Pro Ala Leu Pro Gly Val Leu Ala Cys Gly Glu Pro Gin Leu 
1365 1370 1375 



gcg ttg cga cgt ggc ggc get cat gcg ccc aga ctg get gga ctg ggc 4176 
Ala Leu Arg Arg Gly Gly Ala His Ala Pro Arg Leu Ala Gly Leu Gly 
1380 1385 1390 

age gat gac gtc ctg ccc gtg ccg gac ggc acc ggg tgg cga ttg gag 4224 
Ser Asp Asp Val Leu Pro Val Pro Asp Gly Thr Gly Trp Arg Leu Glu 
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1395 



1400 



1405 



gcc acg cgc ccg gga age ctg gat ggg ttg gca ttg gtg gac gaa ccg 
Ala Thr Arg Pro Gly Ser Leu Asp Gly Leu Ala Leu Val Asp Glu Pro 
1410 1415 1420 



4272 



acg gcc acg gca ccg ctg ggt gac ggt gag gtc agg att gcg atg cgc 
Thr Ala Thr Ala Pro Leu Gly Asp Gly Glu Val Arg He Ala Met Arg 
1425 1430 1435 1440 



4320 



gcg gcc ggg gtg aac ttc egg gat gcg etc ate gcg etc ggt atg tat 
Ala Ala Gly Val Asn Phe Arg Asp Ala Leu He Ala Leu Gly Met Tyr 
1445 1450 1455 



4368 



ccc ggt gtg gca teg ctg ggc agt gag ggc gcc ggg gtc gtg gtg gag 
Pro Gly Val Ala Ser Leu Gly Ser Glu Gly Ala Gly Val Val Val Glu 
1460 1465 1470 



4416 



ace ggc ccc ggc gtc ace ggc ctg gca ccc ggc gac cgc gtg atg gga 
Thr Gly Pro Gly Val Thr Gly Leu Ala Pro Gly Asp Arg Val Met Gly 
1475 1480 1485 



4464 



atg ate ccg aag gcg ttc ggg ccg etc gcg gtc gcc gac cat cgc atg 
Met He Pro Lys Ala Phe Gly Pro Leu Ala Val Ala Asp His Arg Met 
1490 1495 1500 



4512 



gtg acg agg att ccc get ggt tgg age ttc gcg egg gcc gca teg gtg 
Val thr Arg He Pro Ala Gly Trp Ser Phe Ala Arg Ala Ala Ser Val 
1505 1510 1515 1520 



4560 



ccg ate gtc ttt etc ace gcc tac tac gcg ctg gtt gat etc gcc ggg 
Pro lie Val Phe Leu Thr Ala Tyr Tyr Ala Leu Val Asp Leu Ala Gly 
1525 1530 1535 



4608 



ttg aga cca ggg gag teg ttg ctg gtt cat teg gcc gcc ggt ggg gtg 
Leu Arg Pro Gly Glu Ser Leu Leu Val His Ser Ala Ala Gly Gly Val 
1540 1545 1550 



4656 



ggg atg gcc gcg ate caa etc gcc agg cac etc ggt gca gag gtg tac 
Gly Met Ala Ala lie Gin Leu Ala Arg His Leu Gly Ala Glu Val Tyr 
1555 1560 1565 



4704 



gcc ace get age gag gac aag tgg caa gcc gtg gag ctg age cga gaa 
Ala Thr Ala Ser Glu Asp Lys Trp Gin Ala Val Glu Leu Ser Arg Glu 
1570 1575 1580 



4752 



cac etc get teg teg egg acg tgc gat ttc gag cag cag ttc etc ggg 
His Leu Ala Ser Ser Arg Thr Cys Asp Phe Glu Gin Gin Phe Leu Gly 



4800 
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1585 1590 1595 1600 



gca acc ggc gga cgc ggc gtc gac gtc gtg etc aac tec etc gee ggg 484 8 
Ala Thr Gly Gly Arg Gly Val Asp Val Val Leu Asn Ser Leu Ala Gly 
1605 1610 1615 



gag ttc gee gat gcg tct ctg cga atg ctg ccg cgc ggt ggc cgt ttc 4896 
Glu Phe Ala Asp Ala Ser Leu Arg Met Leu Pro Arg Gly Gly Arg Phe 
1620 1625 1630 



ctg gag ttg ggg aag acg gat gtt cgt gac ccc gtc gag gtc gee' gat 4944 
Leu Glu Leu Gly Lys Thr Asp Val Arg Asp Pro Val Glu Val Ala Asp 
1635 1640 1645 

gcg cat ccg ggc gtg tct tac cag get ttc gat acc gta gag gca ggc 4992 
Ala His Pro Gly Val Ser Tyr Gin Ala Phe Asp Thr Val Glu Ala Gly 
1650 1655 1660 

ccg cag cga ate ggc- gag atg ctt cac gag ctg gtg gag ttg ttc gag 504 0 
Pro Gin Arg lie Gly Glu Met Leu His Glu Leu Val Glu Leu Phe Glu 
1665 1670 1675 1680 



gga cgc gtg ctg gag ccc ctg cct gtc acg get tgg gac gtt egg cag 5088 
Gly Arg Val Leu Glu Pro Leu Pro Val Thr Ala Trp Asp Val Arg Gin 
1685 1690 1695 

gcg ccc gag gcg eta egg cac ctg age caa gcg egg cat gtg gga aag 5136 
Ala Pro Glu Ala Leu Arg His Leu Ser Gin Ala Arg His Val Gly Lys 
1700 1705 1710 

ctg gtg etc acc atg cct ccg gtg tgg gac gee gca ggc acg gtt ctg 5184 
Leu Val Leu Thr Met Pro Pro Val Trp Asp Ala Ala Gly Thr Val Leu 
1715 1720 1725 

gtt acc ggc gga acg gga gca ctt ggc gca gag gtc gee egg cac etc 5232 
Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Glu Val Ala Arg His Leu 
1730 1735 1740 



gtg ate gag cgc ggg gtg cga aac ctg gtc etc gtc age agg cgc ggt 52 8 0 
Val lie Glu Arg Gly Val Arg Asn Leu Val Leu Val Ser Arg Arg Gly 
1745 1750 1755 1760 



ccc gca gee agt ggc get get gag etc gtg gcg caa ctg acg gee tac 5328 
Pro Ala Ala Ser Gly Ala Ala Glu Leu Val Ala Gin Leu Thr Ala Tyr 
1765 1770 1775 



ggt gee gag gtt tec ttg cag get tgc gat gtc gec gat cgt gag acc 5376 
Gly Ala Glu Val Ser Leu Gin Ala Cys Asp Val Ala Asp Arg Glu Thr 
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1780 1785 1790 

ttg gcg aag gtg ctt gcc age ate ccg gac gag cat ccg ttg ace gee 5424 
Leu Ala Lys Val Leu Ala Ser lie Pro Asp Glu His Pro Leu Thr Ala 
1795 1800 1805 



gtg gtg cac gcg get ggt gtt etc gac gac gga gtg tec gaa teg etc 54 72 
Val Val His Ala Ala Gly Val Leu Asp Asp Gly Val Ser Glu Ser Leu - 
1810 1815 1820 



ace gtg gag egg ctg gac cag gtt ctg cgc ccg aag gtc gat ggc gcg 5520 
Thr Val Glu Arg Leu Asp Gin Val Leu Arg Pro Lys Val Asp Gly Ala 
1825 1830 1835 1840 



egg aat ctg etc gag ctg ate gac ccg gac gtg gcc etc gtg ttg ttc 556B 

Arg Asn Leu Leu Glu Leu lie Asp Pro Asp Val Ala Leu Val Leu Phe 
1845 1850 1855 

teg teg gtg teg ggt gtg etc ggc age ggt ggg cag ggt aac tac gcg . 5616 

Ser Ser Val Ser Gly Val Leu Gly Ser Gly Gly Gin Gly Asn Tyr Ala 
1860 1865 1870 

gcg gcc aac tec ttc etc gac gca ttg gcg cag caa agg cag teg cgc 5664 

Ala Ala Asn Ser Phe Leu Asp Ala Leu Ala Gin Gin Arg Gin Ser Arg 
1875 1880 1885 



ggc eta ccg acg aga tea ttg gcc tgg ggg ccc tgg gcg gaa cat ggc 5712 
Gly Leu Pro Thr Arg Ser Leu Ala Trp Gly Pro Trp Ala Glu His Gly 
1890 1895 1900 



atg gcc age acc ttg cgc gaa gcc gag cag gat cga ttg gcg cga tct 5760 
Met Ala Ser Thr Leu Arg Glu Ala Glu Gin Asp Arg Leu Ala Arg Ser 
1905 1910 1915 1920 

ggg ttg ctg ccg ate teg acc gag gag ggg ttg tec cag ttc gac gcc 58 08 
Gly Leu Leu Pro He Ser Thr Glu Glu Gly Leu Ser Gin Phe Asp Ala 
1925 1930 1935 

gcg tgc ggc ggc gcg cat acc gtg gtg gcg ccg gtt cga ttc age cgc 5856 
Ala Cys Gly Gly Ala His Thr Val Val Ala Pro Val Arg Phe Ser Arg 
1940 1945 1950 



ttg tec gac ggg aac gcg ate aag ttc tec gtc ctg caa ggt ttg gtc 5904 
Leu Ser Asp Gly Asn Ala He Lys Phe Ser Val Leu Gin Gly Leu Val 
1955 1960 1965 

ggg ccg cat cgc gtc aac aaa gcg gcg act gcg gat gat gcc gag age 5952 
Gly Pro His Arg Val Asn Lys Ala Ala Thr Ala Asp Asp Ala Glu Ser 
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1970 1975 1980 

etc egg aaa egg ttg gga cgc ttg ccg gat gca gaa caa cat egg att 6000 
Leu Arg Lys Arg Leu Gly Arg Leu Pro Asp Ala Glu Gin His Arg lie 
1985 1990 1995 2000 

ctg ctg gac etc gtc cgc atg cat gtg gcg gca gtg etc gga ttc gee 6048 
Leu Leu Asp Leu Val Arg Met His Val Ala Ala Val Leu Gly Phe Ala 
2005 2010 2015 

ggt tct cag gag ate ace gcg gac ggc acg ttc aag gtg ctg ggc ttc 6096 
Gly Ser Gin Glu lie Thr Ala Asp Gly Thr Phe Lys Val Leu Gly Phe 
2020 2025 2030 

gac teg ttg acc gtg gtc gag ttg cgc aac egg ate aac ggg gcg acg 6144 
Asp Ser . Leu Thr Val Val Glu Leu Arg Asn Arg lie Asn Gly Ala Thr 
2035 2040 2045 



ggg ct g c 9 a ctg ccc gee acc ctg gtg ttc aac tac ccg acg ccg gat 6192 
Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asn Tyr Pro Thr Pro Asp 
2050 2055 2060 

gcg etc gec gcg cac etc gtc acc gcg ctg tec gca gac cgc ctg gec 6240 
Ala Leu Ala Ala His Leu Val Thr Ala Leu Ser Ala Asp Arg Leu Ala 
2065 2070 2075 2080 

ggg aca ttc gag gaa etc gac agg tgg gcg gcg aac ctg ccc acg ctg 6288 
Gly Thr Phe Glu Glu Leu Asp Arg Trp Ala Ala Asn Leu Pro Thr Leu 
2085 2090 2095 

gee agg gat gag gee acg egg gcg cag ate acc acc egg eta cag gcg 6336 
Ala Arg Asp Glu Ala Thr Arg Ala Gin lie Thr Thr Arg Leu Gin Ala 
2100 2105 2110 

ate ttg cag age ctg gcg gac gtg tec ggc gga acc ggc ggc ggc tec 6384 
He Leu Gin Ser Leu Ala Asp Val Ser Gly Gly Thr Gly Gly Gly Ser 
2115 2120 2125 

gtg ccg gac egg etc aga teg gec acg gac gac gag ctt ttc caa etc 6432 
Val Pro Asp Arg Leu Arg Ser Ala Thr Asp Asp Glu Leu Phe Gin Leu 
2130 2135 2140 

etc gac aac gat etc gaa ctt ccc tga 6459 
Leu Asp Asn Asp Leu Glu Leu Pro 
2145 2150 



<210> 44 
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<211> 2152 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 44 

Met Thr Val Thr Thr Ser Tyr Glu Glu Val Val Glu Ala Leu Arg Ala 
15 10 15 

Ser Leu Lys Glu Asn Glu Arg Leu Arg Arg Gly Arg Asp Arg Phe Ser 
20 25 30 

Ala Glu Lys Asp Asp Pro lie Ala lie Val Ala Met Ser Cys Arg Tyr 
35 40 45 

Pro Gly Gin Val Ser Ser Pro Glu Asp Leu Trp Gin Leu Ala Ala Gly 
50 55 60 

Gly Val Asp Ala lie Ser Glu Val Pro Gly Asp Arg Gly Trp Asp Leu 
65 70 75 80 

Asp Gly Val Phe Val Pro Asp Ser Asp Arg Pro Gly Thr Ser Tyr Ala 
85 90 95 

Cys Ala Gly Gly Phe Leu Gin Gly Val Ser Glu Phe Asp Ala Gly Phe 
100 105 110 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
115 ' 120 125 

Leu Leu Leu Glu Val Ala Trp Glu Val Phe Glu Arg Ala Gly Leu Glu 
130 135 140 

Gin Arg Ser Thr Arg Gly Ser Arg Val Gly Val Phe Val Gly Thr Asn 
145 150 155 160 

Gly Gin Asp Tyr Ala Ser Trp Leu Arg Thr Pro Pro Pro Ala Val Ala 
165 170 175 

Gly His Val Leu Thr Gly Gly Ala Ala Ala Val Leu Ser Gly Arg Val 
180 185 190 

Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala 
195 200 205 

Cys Ser Ser Ser Leu, Val Ala Leu His Leu Ala Gly Gin Ala Leu Arg 
210 215 220 

Ala Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser 
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225 230 235 240 

Thr Pro Lys Val Phe Leu Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro 
245 250 255 

Asp Gly Arg Cys Lys Ser Phe Ala Ala Gly Ala Asp Gly Thr Gly Trp 
260 265 270 

Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg 
275 280 285 

Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser Ala Val Asn 
290 295 300 

Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Ser Ser Gin 
305 310 315 320 ■ 

Gin Arg Val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser Val Ser 
325 330 335 

Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp 
340 345 . 350 

Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg Asp 
355 360 365 

Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly His 
370 375 380 

Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Met Ala 
385 390 395 400 

Met Arg His Gly Gin Leu Pro Arg Thr Leu His Val Glu Ser Pro Ser 
405 410 415 

Pro Glu Val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu Asn 
420 425 430 

Thr Pro Trp Pro Arg Ser Gly Arg Val Arg Arg Val Gly Val Ser Ser 
435 440 445 

Phe Gly He Ser Gly Thr Asn Ala His Val He Leu Glu Gin Pro Pro 
450 455 460 

Gly Val Pro Ser Gin Ser Ala Gly Pro Gly Ser Gly Ser Val Val Asp 
465 470 475 480 

Val Pro Val Val Pro Trp Met Val Ser Gly Lys Thr Pro Glu Ala Leu 
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485 



490 



495 



Ser Ala Gin Ala Thr Ala Leu Met Thr Tyr Leu Asp Glu Arg Pro Asp 
500 505 510 

Val Ser Ser Leu Asp Val Gly Tyr Ser Leu Ala Leu Thr Arg Ser Ala 
515 520 525 

Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Glu Thr Leu Leu 
530 535 540 

Cys Gly Val Lys Ala Leu Ser Ala Gly His Glu Ala Ser Gly Leu Val 
545 550 555 560 

Thr Gly Ser Val Gly Ala Gly Gly Arg He Gly Phe Val Phe Ser Gly 
565 570 575 

Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu Tyr Arg Ala Phe 
580 585 590 

Pro Val Phe Ala Ala Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp Ala 
595 600 605 

His Leu Gly Gin Glu He Gly Val Arg Glu Val Val Ser Gly Ser Asp 
610 615 620 

Ala Gin Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

Leu Gin Val Gly Leu Leu Lys Leu Leu Asp Ser Trp Gly Val Arg Pro 
645 650 655 

Ser Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala 
660 665 670 

Ala Gly Val Val Ser Leu Ser Gly Ala Ala Arg Leu Val Ala Gly Arg 
675 680 685 

Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Gly Met Leu Ala Val 
690 695 700 

Pro Ala Gly Glu Glu Leu Leu Trp Ser Leu Leu Ala Asp Gin Gly Asp 
705 710 715 720 

Arg Val Gly He Ala Ala Val Asn Ala Ala Gly Ser Val Val Leu Ser 
725 730 735 



Gly Asp Arg Asp Val Leu Asp Asp Leu Ala Gly Arg Leu Asp Gly Gin 
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740 



745 



750 



Gly He Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser Tyr 
755 760 765 

Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Thr Val 
770 775 780 

Asp Tyr Arg Arg Cys Glu Val Pro He Val Ser Thr Leu Thr Gly Asp 
785 790 795 800 

Leu Asp Asp Ala Gly Arg Met Ser Gly Pro Asp Tyr Trp Val Arg Gin 
805 810 815 

Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Gin Ala Leu Val Glu 
820 825 830 

His Asp Val Ala Thr Val Val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

Ala Leu He Gin Glu Cys Val Ala Ala Ser Asp His Ala Gly Arg Leu 
850 855 860 

Ser Ala Val Pro Ala Met Arg Arg Asn Gin Asp Glu Ala Gin Lys Val 
865 870 875 880 

Met Thr Ala Leu Ala His Val His Val Arg Gly Gly Ala Val Asp Trp 
885 890 895 

Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin He Glu Leu Pro Thr 
900 905 910 

Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Val Pro Ser Asp Ser Gly 
915 920 925 

Asp Val Thr Gly Ala Gly Leu Ala Gly Ala Glu His Pro Leu Leu Gly 
930 935 940 

Ala Val Val Pro Val Ala Gly Gly Asp Glu Val Leu Leu Thr Gly Arg 
945 950 955 960 

He Ser Val Arg Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly 
965 970 975 

Glu Val He Val Ala Gly Thr Ala Leu Leu Glu He Ala Leu His Ala 
980 985 990 



Gly Glu Arg Leu Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala 
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995 1000 1005 

Pro Leu Val Leu Pro Glu Arg Gly Ala lie Gin Val Gin Leu Arg Val 
1010 1015 1020 

Gly Ala Pro Glu Asn.Ser Gly Arg Arg Pro Met Ala Leu Tyr Ser Arg 
025 1030 1035 1040 

Pro Glu Gly Ala Ala Glu His Asp Trp Thr Arg His Ala Thr Gly Arg 
1045 1050 1055 

Leu Ala Pro Gly Arg Gly Glu Ala Ala Gly Asp Leu Ala Asp Trp Pro 
1060 1065 1070 

Ala Pro Gly Ala Leu Pro Val Asp Leu Asp Glu Phe Tyr Arg Asp Leu 
1075 1080 1085 

Ala Glu Leu Gly Leu Glu Tyr Gly Pro lie Phe Gin Gly Leu Lys Ala 
1090 1095 1100 

Ala Trp Arg Gin Gly Asp Glu Val Tyr Ala Glu Ala Ala Leu Pro Gly 
105 1110 1115 1120 

Thr Glu Asp Ser Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala 
1125 1130 1135 

Leu His Ala Thr Ala Val Arg Asp Met Asp Asp Ala Arg Leu Pro Phe 
1140 1145 1150 

Gin Trp Glu Gly Val Ser Leu His Ala Lys Ala Ala Pro Ala Leu Arg 
1155 1160 1165 

Val Arg Val Val Pro Ala Gly Asp Asp Ala Lys Ser Leu Leu Val Cys 
1170 1175 1180 

Asp Gly Thr Gly Arg Pro Val He Ser Val Asp Arg Leu Val Leu Arg 
185 1190 1195 1200 

Ser Ala Ala Ala Arg Arg Thr Gly Ala Arg Arg Gin Ala His Glti Ala 
1205 1210 1215 

Arg Leu Tyr Arg Leu Ser Trp Pro Thr Val Gin Leu Pro Thr Ser Ala 
1220 1225 1230 

Gin Pro Pro Ser Cys Val Leu Leu Gly Thr Ser Glu Val Ser Ala Asp 
1235 1240 1245 

He Gin Val Tyr Pro Asp Leu Arg Ser Leu Thr Ala Ala Leu Asp Ala 
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1250 1255 1260 



Gly Ala Glu Pro Pro Gly Val Val lie Ala Pro Thr Pro Pro Gly Gly 
265 1270 1275 1280 

Gly Arg Thr Ala Asp Val Arg Glu Thr Thr Arg His Ala Leu Asp Leu 
1285 1290 1295 

Val Gin Gly Trp Leu Ser Asp Gin Arg Leu Asn Glu Ser Arg Leu Leu 
1300 1305 1310 

Leu Val Thr Gin Gly Ala Val Ala Val Glu Pro Gly Glu Pro Val Thr 
1315 1320 1325 

Asp Leu Ala Gin Ala Ala Leu Trp Gly Leu. Leu Arg Ser Thr Gin Thr 
1330 1335 1340 

Glu His Pro Asp Arg Phe Val Leu Val Asp Val Pro Glu Pro Ala Gin 
345 1350 1355 1360 

Leu Leu Pro Ala Leu Pro Gly Val Leu Ala Cys Gly Glu Pro Gin Leu 
1365 1370 1375 

Ala Leu Arg Arg Gly Gly Ala His Ala Pro Arg Leu Ala Gly Leu Gly 
1380 1385 1390 



Ser Asp Asp Val Leu Pro Val Pro Asp Gly Thr Gly Trp Arg Leu Glu 
1395 1400 1405 

Ala Thr Arg Pro Gly Ser Leu Asp Gly Leu Ala Leu Val Asp Glu Pro 
1410 1415 1420 

Thr Ala Thr Ala Pro Leu Gly Asp Gly Glu Val Arg He Ala Met Arg 
425 1430 1435 1440 

Ala Ala Gly Val Asn Phe Arg Asp Ala Leu He Ala Leu Gly Met Tyr 
1445 1450 1455 

Pro Gly Val Ala Ser Leu Gly Ser Glu Gly Ala Gly Val Val Val Glu 
1460 1465 1470 

Thr Gly Pro Gly Val Thr Gly Leu Ala Pro Gly Asp Arg Val Met Gly 
1475 1480 1485 

Met He Pro Lys Ala Phe Gly Pro Leu Ala Val Ala Asp His Arg Met 
1490 1495 1500 

Val Thr Arg He Pro Ala Gly Trp Ser Phe Ala Arg Ala Ala Ser Val 
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505 1510 1515 1520 

Pro lie Val Phe Leu Thr Ala Tyr Tyr Ala Leu Val Asp Leu Ala Gly 
1525 1530 1535 

Leii Arg Pro Gly Glu Ser Leu Leu Val His Ser Ala Ala Gly Gly Val 
1540 1545 1550 

Gly Met Ala Ala lie Gin Leu Ala Arg His Leu Gly Ala Glu Val Tyr 
1555 1560 1565 

Ala Thr Ala Ser Glu Asp Lys Trp Gin Ala Val Glu Leu Ser Arg Glu 
1570 1575 1580 

His Leu Ala Ser Ser Arg Thr Cys Asp Phe Glu Gin Gin Phe Leu Gly 
585 1590 1595 1600 

{ Ala Thr Gly Gly Arg Gly Val Asp Val Val Leu Asn Ser Leu Ala Gly 
1605 1610 1615 

Glu Phe Ala Asp Ala Ser Leu Arg Met Leu Pro Arg Gly Gly Arg Phe 
1620 1625 1630 

Leu Glu Leu Gly Lys Thr Asp Val Arg Asp Pro Val Glu Val Ala Asp 
1635 1640 1645 

Ala His Pro Gly Val Ser Tyr Gin Ala Phe Asp Thr Val Glu Ala Gly 
1650 1655 1660 

Pro Gin Arg He Gly Glu Met Leu His Glu Leu Val Glu Leu Phe Glu 
665 1670 1675 1680 

Gly Arg Val Leu Glu Pro Leu Pro Val Thr Ala Trp Asp Val Arg Gin 
1685 1690 1695 

Ala Pro Glu Ala Leu Arg His Leu Ser Gin Ala Arg His Val Gly Lys 
1700 1705 1710 

Leu Val Leu Thr Met Pro Pro Val Trp Asp Ala Ala Gly Thr Val Leu 
1715 1720 1725 

Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Glu Val Ala Arg His Leu 
1730 1735 1740 

Val He Glu Arg Gly Val Arg Asn Leu Val Leu Val Ser Arg Arg Gly 
745 1750 1755 1760 

Pro Ala Ala Ser Gly Ala Ala Glu Leu Val Ala Gin Leu Thr Ala Tyr 
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1765 1770 1775 

Gly Ala Glu Val Ser Leu Gin Ala Cys Asp Val Ala Asp Arg Glu Thr 
1780 1785 1 1790 

Leu Ala Lys Val Leu Ala Ser He Pro Asp Glu His Pro Leu Thr Ala 
1795 1800 1805 

Val Val His Ala Ala Gly Val Leu Asp Asp Gly Val Ser Glu Ser Leu 
1810 1815 1820 

Thr Val Glu Arg Leu Asp Gin Val Leu Arg Pro Lys Val Asp Gly Ala 
825 1830 1835 1840 

Arg Asn Leu Leu Glu Leu He Asp Pro Asp Val Ala Leu Val Leu Phe 
1845 . 1850 1855 

Ser Ser Val Ser Gly Val Leu Gly Ser Gly Gly Gin Gly Asn Tyr Ala 
1860 1865 1870 

Ala Ala Asn Ser Phe Leu Asp Ala Leu Ala Gin Gin Arg Gin Ser Arg 
1875 1880 1885 

Gly Leu Pro Thr Arg Ser Leu Ala Trp Gly Pro Trp Ala Glu His Gly 
1890 1895 1900 

Met Ala Ser Thr Leu Arg Glu Ala Glu Gin Asp Arg Leu Ala Arg Ser 
905 1910 1915 1920 

Gly Leu Leu Pro He Ser Thr Glu Glu Gly Leu Ser Gin Phe Asp Ala 
1925 1930 1935 

Ala Cys Gly Gly Ala His Thr Val Val Ala Pro Val Arg Phe Ser Arg 
1940 1945 1950 

Leu Ser Asp Gly Asn Ala He Lys Phe Ser Val Leu Gin Gly Leu Val 
1955 I960 1965 

Gly Pro His Arg Val Asn Lys Ala Ala Thr Ala Asp Asp Ala Glu Ser 
1970 1975 1980 

Leu Arg Lys Arg Leu Gly Arg Leu Pro Asp Ala Glu Gin His Arg He 
985 1990 1995 2000 

Leu Leu Asp Leu Val Arg Met His Val Ala Ala Val Leu Gly Phe Ala 
2005 2010 2015 

Gly Ser Gin Glu He Thr Ala Asp Gly Thr Phe Lys Val Leu Gly Phe 
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2025 2030 



Asp Ser Leu Thr Val Val Glu Leu Arg Asn Arg lie Asn Gly Ala Thr 
2035 2040 2045' 

Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asn Tyr Pro Thr Pro Asp 
2050 2055 2060 

Ala Leu Ala Ala His Leu Val Thr Ala Leu Ser Ala Asp Arg Leu Ala 
065 2070 2075 2080 

Gly Thr Phe Glu Glu Leu Asp Arg Trp Ala Ala Asn Leu Pro Thr Leu 
20B5 2090 2095 

Ala Arg Asp Glu Ala Thr Arg Ala Gin lie Thr Thr Arg Leu Gin Ala 
2100 2105 2110 

lie Leu Gin Ser Leu Ala Asp Val Ser Gly Gly Thr Gly Gly Gly Ser 
2115 2120 2125 

Val Pro Asp Arg Leu Arg Ser Ala Thr Asp. Asp Glu Leu Phe Gin Leu 
2130 2135 2140 

Leu Asp Asn Asp Leu Glu Leu Pro 
145 2150 



<210> 45 
<211> 9513 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (9510) 

<223> ORF20; Polyketidsynthase 

<400> 45 

atg teg aat gaa gag aag etc egg gag tac ttg egg cgt gcg etc gtg 48 
Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala Leu Val 
1 5 10 15 



gat ctg cac cag gcg cgc gag egg ctg cac gag gcg gag teg gga gag 96 
Asp Leu His Gin Ala Arg Glu Arg Leu His Glu Ala Glu Ser Gly Glu 
20 25 30 

egg gaa ccc ate gcg ate gtg gcg atg ggc tgc egg tac ccg ggt ggg 14 4 
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Arg Glu Pro lie Ala lie Val Ala Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 



gtg cag gac ccg gaa gg^ ctg tgg aaa ctg gtc gcc tec ggt ggc gac 192 
Val Gin Asp Pro Glu Gly Leu Trp Lys Leu Val Ala Ser Gly Gly Asp 
50 55 60 

gcc ate ggt gaa ttc ccc get gat cgt ggt tgg cac etc gac gag etc 240 
Ala lie Gly Glu Phe Pro Ala Asp Arg Gly Trp His Leu Asp Glu Leu 
65 70 75 80 

tac gat ccc gac ccg gat cag ccc gga ace tgc tac ace egg cac ggc 288 
Tyr Asp Pro Asp Pro Asp Gin Pro Gly Thr Cys Tyr Thr Arg His Gly 
85 90 95 



ggc ttc etc cac gac gcc ggc gag 
Gly Phe Leu His Asp Ala Gly Glu 
100 

age ccc cgt gag gcg etc gcg atg 
Ser Pro Arg Glu Ala Leu Ala Met 
115 120 



ttc gac gcg gga ttc ttc gac ate 336 
Phe Asp Ala Gly Phe Phe Asp He 
105 110 

gac ccg cag cag egg ctg ctg ctg 384 
Asp Pro Gin Gin Arg Leu Leu Leu 

12.5 



gaa ate tec tgg gag acc gtc gaa tec get ggg atg gac ccg agg tec 432 
Glu He Ser Trp Glu Thr Val Glu Ser Ala Gly Met Asp Pro Arg Ser 
130 135 140 

ttg egg ggg age cgc acc ggg gtg ttc gcg gga ttg atg tac gag ggc 480 
Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly. Leu Met Tyr Glu Gly 
145 150 155 . 160 

tat gac acc ggc gcc cac egg gca gga gaa ggt gtc gaa ggc tat etc 528 
Tyr Asp Thr Gly Ala His Arg Ala Gly Glu Gly Val Glu Gly Tyr Leu 
165 170 ' 175 



gga acc ggc aat gcg gga age gtc gcc tct ggt egg gtt gcg tat gcg 576 
Gly Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Ala 
. 180 185 190 

ttc ggg ttc gag ggc cca gcg gtg acg gta gac acg gcg tgc teg teg 624 
Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 



teg ttg gtg gcg ctg cat ttg gcg tgt cag teg ttg egg cag ggc gag 672 
Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Gin Gly Glu 
210 215 220 



tgt gat ctg gcg ctg gcc ggt gga gtg acg gtg atg teg acg ccg gag 72 0 
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Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser Thr Pro Glu 
225 230 235 240 

agg ttc gtg gag ttc tec cgt cag cgt ggt etc gca ccg gat ggg egg 768 
Arg Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp' Gly Arg 
245 250 255 

tgt aag teg ttc gcg gcg get gcg gat gga ace ggt tgg ggt gag ggt 816 
Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 

gee ggt ttg gtg ttg ctg gag egg ctg tea gac gee agg egg aac ggg 864 
Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 



cat egg gta ctg gcg gtt gtt .cgt ggt 
His Arg Val Leu Ala Val Val Arg Gly 
290 295 

gcg teg aac gga ttg acg gee ccg aac 
Ala Ser Asn Gly Leu Thr Ala Pro Asn 
305 310 

att cag cag gtg etc acg agt gcg ggg 
He Gin Gin Val^Leu Thr Ser Ala Gly 
325 



age gcg gtg aat cag gac ggt 912 
Ser Ala Val Asn Gin Asp Gly 
300 

ggg ctg gee cag gag egg gtc 960 
Gly Leu Ala Gin Glu Arg Val 
315 320 

ctg teg gcg tec gat gtg gac 1008 
Leu Ser Ala Ser Asp Val Asp ' 
330 335' 



get gtg gag gcg cat gga acg ggt acg egg ctt ggt gat ccg ate gag 1056 
Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu 
340 345 350 

gcg cag get ctg ata gee gee tat gga cag gat egg gac egg gac egg 1104 
Ala Gin Ala Leu He Ala Ala Tyr Gly Gin Asp Arg Asp Arg Asp Arg 
355 360 365 

ccg ctg tgg ttg ggg teg gtc aag tec aac ate ggt cat acg cag gcg 1152 
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly His Thr Gin Ala 
370 375 380 

get gcg ggc gtc get ggt gtg ate aag atg gtc atg gcg atg egg cac 1200 
Ala Ala Gly Val Ala Gly Val He Lys Met Val Met Ala Met Arg His 
385 390 395 400 

ggg gag ctg ccg cgc acg ttg cac gtg gac gag ccg aat teg cac gtg 124 8 
Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro Asn Ser His Val 
405 410 415 

gac tgg teg get ggt gcg gtc cga etc ctg ace gag aac ate cgc tgg 1296 
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Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Glu Asn He Arg Trp 
420 425 430 

cca ggg acg ggt acg cgc cgc get gga gtg teg teg ttc ggg gta. age 1344 
Pro Gly Thr Gly Thr Arg Arg Ala Gly Val Ser Ser Phe Gly Val" Ser 
435 440 445 

ggt ace aac gca cac gtc ate etc gaa cac gac ccg etc gee gtg ace 1392 
Gly Thr Asn Ala His Val He Leu Glu His Asp Pro Leu Ala Val Thr 
450 455 460 

gag aac gag gaa gca gcg cag tec cca gca cct ggg ate gtg ccc tgg 1440 
Glu Asn Glu Glu Ala Ala Gin Ser Pro Ala Pro Gly He Val Pro Trp 
465 470 475 480 

gcg ttg tec ggg egg teg teg acg gcg ctg egg gee cag gee gaa egg 14 88 
Ala Leu Ser Gly Arg Ser Ser Thr Ala Leu Arg Ala Gin Ala Glu Arg 
485 490 495 

ctg cgc gag ctg tgc gag cag ace gat ccc gac ccc gtc gat gtc ggt 1536 
Leu Arg Glu Leu Cys Glu Gin Thr Asp Pro Asp Pro Val Asp Val Gly 
500 505 510 

ttc tea ctg gee gee acg cgc acg get tgg gag cac cga gcg gtg gtg 1584 
Phe Ser Leu Ala Ala Thr Arg Thr Ala Trp Glu His Arg Ala Val Val 
515 520 525 

ctt ggt egg gac age get acg ttg cgc tec ggg ctt ggc gtt gtt gee 1632 
Leu Gly Arg Asp Ser Ala Thr Leu Arg Ser Gly Leu Gly Val Val Ala 
530 535 540 

age ggt gaa cca gcg gtc gat gtc gtt gag ggg age gtc ctg gac ggc 1680 
Ser Gly Glu Pro Ala Val Asp Val Val Glu Gly Ser Val Leu Asp Gly 
545 550 555 560 

gag gtc gtc ttc gtc ttc ccc ggt cag ggc tgg cag tgg gee ggt atg 172 8 
Glu Val Val Phe Val Phe Pro Gly Gin Gly Trp Gin Trp Ala Gly Met 
565 570 575 

gca gtc gac ctg ctg gac get teg ccg acg ttc gcg cgc cac atg gac 1776 
Ala Val Asp Leu Leu Asp Ala Ser Pro Thr Phe Ala Arg His Met Asp 
580 585 590 

gag tgc gee ace gcg ctg egg agg tac gtg gac tgg teg ttg gtc gac 1824 
Glu Cys Ala Thr Ala Leu Arg Arg Tyr Val Asp Trp Ser Leu Val Asp 
595 600 605 

gtg ctg cgc gga gcg gag aac tec cca ccg ctg gac egg gtg gac gtg 1872 
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Val Leu Arg Gly Ala Glu Asn Ser Pro Pro Leu Asp Arg Val Asp Val 
610 615 620 

etc cag ccc gcg tec ttc gcg gtg atg gtg teg etc gee gag gtg tgg 1920 
Leu Gin Pro Ala Ser Phe Ala Val Met Val Ser Leu Ala Glu Val Trp 
625 630 635 640 

cgt tec tac ggg gtg agg ccg gcg gee gtc gtc ggc cac agt caa ggc 1968 
Arg Ser Tyr Gly Val Arg Pro Ala Ala Val Val Gly His Ser Gin Gly 
645 650 655 

gaa ate gee gcg gee tgc gca gee ggg gtg ctg ccg ctg gag gat gcg 2016 
Glu lie Ala Ala Ala Cys Ala Ala Gly Val Leu Pro Leu Glu Asp Ala 
660 665 670 

gee agg ctt gtc gca ttg cgc age aga gcg ttg aag gga ctt teg ggg 2064 
Ala Arg Leu Val Ala Leu Arg Ser Arg Ala Leu Lys Gly Leu Ser Gly 
675 680 685 

egg ggt ggc atg gcg teg ctg gee tgc cct gcg gat gag gtc gcg gca 2112 
Arg Gly Gly Met Ala Ser Leu Ala Cys Pro Ala Asp Glu Val Ala Ala 
690 695 700 

ttg ttc gcg gga teg ggc ggc cgt ctg gaa gtt gcg gcg ate aac ggc 2160 
Leu Phe Ala Gly Ser Gly Gly Arg Leu Glu Val Ala Ala He Asn Gly 
705 710 715 720 

ccg cga teg gtc gtg gtg tec ggc gat ctg gaa gcg gtg gac gaa ctg 2208 
Pro Arg Ser Val Val Val Ser Gly Asp Leu Glu Ala Val Asp Glu Leu 
725 730 735 

ctg gca gag tgc get gaa aag gac atg cgt gca cgc cgt ate ccc gtc 2256 
Leu Ala Glu Cys Ala Glu Lys Asp Met Arg Ala Arg Arg He Pro Val 
740 745 750 

gac tac gee teg cat tea gcg cac gtg gag gtg gtt egg age ccg gtg 2304 
Asp Tyr Ala Ser His Ser Ala His Val Glu Val Val Arg Ser Pro Val 
755 760 765 

ctg gcg gee gee gee ggg gtg cga cac egg gac ggc cag gtg ccg tgg 2352 
Leu Ala Ala Ala Ala Gly Val Arg His Arg Asp Gly Gin Val Pro Trp 
770 775 780 

tgg teg acg gtg ate ggc gac tgg gtg gat ccg gee agg ctg gac ggc 2400 
Trp Ser Thr Val He Gly Asp Trp Val Asp Pro Ala Arg Leu Asp Gly 
785 790 795 800 

gag tat tgg tat egg aac etc egg cag ccg gtc egg ttc gaa cac gee 244 8 
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Glu Tyr Trp Tyr Arg Asn Leu Arg Gin Pro Val Arg Phe Glu His Ala 
805 810 815 

gtg cag ggc ctg gtc gag egg gga ttc ggc ctg ttc ate gaa atg agt 2496 

Val Gin Gly Leu Val Glu Arg Gly Phe Gly Leu Phe He Glu Met Ser 
820 825 830 



gcg cat ccg gtg ctg ace acg gcg gtc gag gaa ace ggt gcg gag teg 2544 
Ala His Pro Val Leu Thr Thr Ala Val Glu Glu Thr Gly Ala Glu Ser 
835 840 845 

gag ace gee gtg gee gcg gta ggt ace ttg cga cgt gac teg ggc ggc 2592 
Glu Thr Ala Val Ala Ala Val Gly Thr Leu Arg Arg Asp Ser Gly Gly 
850 855 860 

etc egg agg ttg ttg cat teg ctg gee gag gcg tac gtg cgc ggc gee 2640 
Leu Arg Arg Leu Leu His Ser Leu Ala Glu Ala Tyr Val Arg Gly Ala 
865 870 875 880 

ace gtg gac tgg gee gtg gcg ttc ggg ggc gcg ggc cga egg ctg gac 2688 
Thr Val Asp Trp Ala Val Ala Phe Gly Gly Ala Gly Arg Arg Leu Asp 
885 890 895 

ctg ccg acc tac ccg ttc cag cgc cag egg tac tgg ctg gac aag gga 2736 
Leu Pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Asp Lys Gly 
900 905 910 



get gee tec gac gag get cgt gcg gtc teg gac ccg gcg gcg ggc tgg 2784 
Ala Ala Ser Asp Glu Ala Arg Ala Val Ser Asp Pro Ala Ala Gly Trp 
915 920 925 

ttc tgg caa gec gtg gcg cgc caa gac ctg aaa age gtg tec gat gec 2832 
Phe Trp Gin Ala Val Ala Arg Gin Asp Leu Lys Ser Val Ser Asp Ala 
930 935 940 



etc gat etc gac gee gac gca ccg ctg age gca aca ctt cca gec ctg 2880 
Leu Asp Leu Asp Ala Asp Ala Pro Leu Ser Ala Thr Leu Pro Ala Leu 
945 950 955 960 



tec gtc tgg cac cgt cag gaa cga gaa agg gtc ttg gca gac ggt tgg 2 928 
Ser Val Trp His Arg Gin Glu Arg Glu Arg Val Leu Ala Asp Gly Trp 
965 970 975 



egg tac cga gtc gac tgg gta egg gtg gec ccg cag ccg gtc egg aga 
Arg Tyr Arg Val Asp Trp Val Arg Val Ala Pro Gin Pro Val Arg Arg 
980 985 990 



2976 



acg egg gaa acc tgg etc ctg gtc gtt ccc ccg ggc ggc ate gag gaa 3024 
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Thr Arg Glu Thr Trp Leu Leu Val Val Pro Pro Gly Gly lie Glu Glu 
995 1000 1005 

gcg ctg gtc gaa egg ctg' acg gat gcg ttg aac acg cga ggg ate age 3072 
Ala Leu Val Glu Arg Leu Thr Asp Ala Leu Asn Thr Arg Gly lie Ser 
1010 1015 1020 

ace ctg cgc etc gac gtg cca ccg gcg gcg acc agt ggc gaa etc gca 3120 
Thr Leu Arg Leu Asp Val Pro Pro Ala Ala Thr Ser Gly Glu Leu Ala . 
1025 1030 1035 1040 

acc gaa etc cgc gee gca gec gac ggt gac ccg gtg aag gca ate ctg 3168 
Thr Glu Leu Arg Ala Ala Ala Asp Gly Asp Pro Val Lys Ala He Leu 
1045 1050 1055 

teg etc acc gcg ttg gac gag cga ccc cac ccc gaa tgc aag gac gtc 3216 
Ser Leu Thr Ala Leu Asp Glu Arg Pro His Pro Glu Cys Lys Asp Val 
1060 1065 1070 

ccg age ggg att gee ttg ctg ctg aac ctg gtc aag gcg etc ggt gaa 3264 
Pro Ser Gly He Ala Leu Leu Leu Asn Leu Val Lys Ala Leu Gly Glu 
1075 1080 1085 

gee gac etc aga att cct ctg tgg acc ate acg cgt ggt gcg gtc aag 3312 
Ala Asp Leu Arg He Pro Leu Trp Thr He Thr Arg Gly Ala Val Lys 
1090 1095 1100 

gca ggc ccc gca gat egg ctg ctg cgc ccg atg cag gcg caa gca tgg 3360 
Ala Gly Pro Ala Asp Arg Leu Leu Arg Pro Met Gin Ala Gin Ala Trp 
1105 1110 1115 1120 

ggt ctg ggg cga gta gee gca etc gaa cac ccc gag cgc tgg ggt ggg 3408 
Gly Leu Gly Arg Val Ala Ala Leu Glu His Pro Glu Arg Trp Gly Gly 
1125 1130 1135 

ctg ate gac ctg ccg gat teg ctg gac ggc gac gtc etc acg agg ctg 3456 
Leu He Asp Leu Pro Asp Ser Leu Asp Gly Asp Val Leu Thr Arg Leu 
1140 1145 1150 

ggc gaa gcg etc acc aac ggc ttg gcg gaa gac caa ctg gcg att cgc 3504 
Gly Glu Ala Leu Thr Asn Gly Leu Ala Glu Asp Gin Leu Ala He Arg 
1155 1160 1165 

cag teg ggc gtg ctg gee egg cga ctg gta ccc gee ccg gcg aat cag 3552 
Gin Ser Gly Val Leu Ala Arg Arg Leu Val Pro Ala Pro Ala Asn Gin 
1170 1175 1180 

ccc get gga cgt aag tgg cgc ccc cga ggg age gcg ctg ate acg ggc 3600 
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Pro Ala Gly Arg Lys Trp Arg Pro Arg Gly Ser Ala Leu He Thr Gly 
H85 1190 1195 1200 

gga etc ggc gcg gtg ggc gca cag gtg gcg agg tgg ttg gec gaa ate 364 8 
Gly Leu Gly Ala Val Gly Ala Gin Val Ala Arg Trp Leu Ala Glu He 
1205 1210 1215 

gga gec gag cga ate gtg etc ace agt cga egg ggc aac caa gca gca 3696 
Gly Ala Glu Arg He Val Leu Thr Ser Arg Arg Gly Asn Gin Ala Ala 
1220 1225 1230 

ggc gee gee gag ctg gaa gec gaa etc egg gec ctt gga gcg caa gtg 374 4 
Gly Ala Ala Glu Leu Glu Ala Glu Leu Arg Ala Leu Gly Ala Gin Val 
1235 1240 1245 

tec ate gtg get tgc gac gtg acc gat cgt gee gag "atg tec gca eta 3792 
Ser He Val Ala Cys Asp Val Thr Asp Arg Ala Glu Met Ser Ala Leu 
1250 1255 1260 

ctg gee gag ttc gac gtc acc gcg gtg ttc cac gcg gee gga gtc ggt 3 84 0 
Leu Ala Glu Phe Asp Val Thr Ala Val Phe His Ala Ala Gly Val Gly 
1265 1270 1275 1280 

egg ctg ctg ccg ttg gcg gag acc gac cag aac ggc ctg gee gaa ata 3888 
Arg Leu Leu Pro Leu Ala Glu Thr Asp Gin Asn Gly Leu Ala Glu He 
1285 1290 1295 

tgc gcg gcg aag gtc cgc ggc get cag gtg ctg gac gaa ctg tgc gac 3936 
Cys Ala Ala Lys Val Arg Gly Ala Gin Val Leu Asp Glu Leu Cys Asp 
1300 1305 1310 

age acc gat etc gat gec ttc gtc ctg ttc tec teg ggt gec ggg gta 3 984 
Ser Thr Asp Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Gly Val 
1315 1320 1325 

tgg ggc ggg ggc ggt cag ggc get tac ggc gcg gcg aac gca ttc ttg 4032 
Trp Gly Gly Gly Gly Gin Gly Ala Tyr Gly Ala Ala Asn Ala Phe Leu 
1330 1335 1340 

gac aca etc gec gaa caa cgc cga gca cgc ggt ctg ccg gca acc teg 4080 
Asp Thr Leu Ala Glu Gin Arg Arg Ala Arg Gly Leu Pro Ala Thr Ser 
1345 1350 1355 1360 

ate tec tgg ggc agt tgg gee ggc ggc ggc atg gee gac ggc gcg gcg 4128 
He Ser Trp Gly Ser Trp Ala Gly Gly Gly Met Ala Asp Gly Ala Ala 
1365 1370 1375 

ggc gaa cac ctg egg cga cgc ggg ata cgt ccg atg ccg gcg gcg teg 4176 
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Gly Glu His Leu Arg Arg Arg Gly He Arg Pro Met Pro Ala Ala Ser 
1380 1385 1390 



gcc ate ctg get ctg cag gaa gta ctt gac cag gat gag acg tgc gtg 
Ala He Leu Ala Leu Gin Glu Val Leu Asp Gin Asp Glu Thr Cys Val 
1395 1400 1405 



4224 



teg ate get gat gtg gac tgg gac cga ttc gtt ccc acg ttc gcc gcg 
Ser He Ala Asp Val Asp Trp Asp Arg Phe Val Pro Thr Phe Ala Ala 
1410 1415 1420 



4272 



act cgc gcc acc egg ttg ttc gac gaa gtg ccg gcg gcg aga aag gcg 
Thr Arg Ala Thr Arg Leu Phe Asp Glu Val Pro Ala Ala Arg Lys Ala 
1425 1430 1435 1440 



4320 



atg ccc gcg aat ggg ccg gca gaa cca ggc ggc teg ccg ttc gcc cgc 
Met Pro Ala Asn Gly Pro Ala Glu Pro Gly Gly Ser Pro Phe Ala Arg 
1445 1450 1455 



4368 



aat etc gcg gag ctg ccg gaa gcc caa cga cgc cac gaa ctg gtg gat 
Asn Leu Ala Glu Leu Pro Glu Ala Gin Arg Arg His Glu Leu Val Asp 
1460 1465 1470 



4416 



ctg gtg tgc gcc cag gtg gca acc gtg etc ggg cac ggc agt cgc gag 
Leu Val Cys Ala Gin Val Ala Thr Val Leu Gly His Gly Ser Arg Glu 
1475 1480 1485 



4464 



gaa gtc cag ccc gag egg gcg ttc cgc gcg etc ggg ttc gac tec etc 
Glu Val Gin Pro Glu Arg Ala Phe Arg Ala Leu Gly Phe Asp Ser Leu 
1490 1495 1500 



4512 



atg gcg gtg gat ctg cgc aat cgt ttg acc acc gcc acc ggg ttg cgc 
Met Ala Val Asp Leu Arg Asn Arg Leu Thr Thr Ala Thr Gly Leu Arg 
1505 1510 1515 1520 



4560 



ctg ccg acc aca acc gtc ttc gac tac ccg aat ccg gcc gcc ttg gcc 
Leu Pro Thr Thr Thr Val Phe Asp Tyr Pro Asn Pro Ala Ala Leu Ala 
1525 1530 1535 



4608 



get cac ctg etc gag gag ctg gtg ggt gat gtc gcg teg get gcg gtg 4656 
Ala His Leu Leu Glu Glu Leu Val Gly Asp Val Ala Ser Ala Ala Val 
1540 1545 1550 

acc get gcc age gcg ccc gcg agt gac gaa ccg ate gcg ate gtc gcg 4704 
Thr Ala Ala Ser Ala Pro Ala Ser Asp Glu Pro He Ala He Val Ala 
1555 1560 1565 



atg age tgc egg ttt ccg ggt ggc gcg cac teg ccg gaa gac ctg tgg 4752 
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Met Ser Cys Arg Phe Pro Gly Gly Ala His Ser Pro Glu Asp Leu Trp 
1570 1575 1580 



egg ctg gtc gec gec ggc acg gag gtg ate ggc gag ttc ccc tec gac 
Arg Leu Val Ala Ala Gly Thr Glu Val lie Gly Glu Phe Pro Ser Asp 
1585 1590 1595 1600 



4800 



egg ggc tgg gat gcg gaa ggc ctt tac gat ccg gat get tec agg cct 
Arg Gly Trp Asp Ala Glu Gly Leu Tyr Asp Pro Asp Ala Ser Arg Pro 
1605 1610 1615 



4848 



gga acg acg tat gcg egg atg gcg gga ttc etc tac gac gee ggt gag 
Gly Thr Thr Tyr Ala Arg Met Ala Gly Phe Leu Tyr Asp Ala Gly Glu 
1620 1625 1630 



4896 



ttc gat gec gac ctg ttc ggc ate age cca cgt gag gcg ttg gcg atg 4 944 
Phe Asp Ala Asp .Leu Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

gat ccg cag cag egg ttg gtg etc gaa ate gee tgg gaa gee etc gaa 4992 
Asp Pro Gin Gin Arg Leu Val Leu Glu He Ala Trp Glu Ala Leu Glu 
1650 1655 1660 

egg gec gga ate gat ccg ttg tec ttg aag ggc agt ggg gtc ggc acg 504 0 
Arg Ala Gly He Asp Pro Leu Ser Leu Lys Gly Ser Gly Val Gly Thr 
1665 1670 1675 1680 

tac ate ggc get gga age cgt ggg tac gcg acg gat gtg egg cag ttt 5088 
Tyr He Gly Ala Gly Ser Arg Gly Tyr Ala Thr Asp Val Arg Gin Phe 
1685 1690 1695 

ccc gag gag gcg gag ggc tac ctg ctg acg ggt ace teg gec agt gtg 5136 
Pro Glu Glu Ala Glu Gly Tyr Leu Leu Thr Gly Thr Ser Ala Ser Val 
1700 1705 1710 

ctg teg ggt egg gtc gcg tat teg ttt ggt ttc gag ggt cct gcg gtg 5184 
Leu Ser Gly Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val 
1715 1720 1725 

acg gtg gat acg get tgt teg teg teg ttg gtg gcg ttg cat ctg gcg 5232 
Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
1730 1735 1740 

tgc cag teg ttg cgt teg ggc gag tgt gat ctg gcg ttg gee ggt ggt 52 80 
Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly 
1745 1750 1755 1760 

gtg ace gtg atg teg acg ccg gag atg ttc gtg gag ttc tec cgt cag 532 8 
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Val Thr Val Met Ser Thr Pro Glu Met Phe Val Glu Phe Ser Arg Gin 
1765 1770 1775 . 



cgc ggt ttg gcg ccg gat ggg egg tgc aag teg ttc gcg gag age gcg 
Arg Gly Leu -Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala 
1780 1785 1790 



5376 



gac ggc ace ggc tgg ggc gaa ggc gcg ggc ctg ttg ttg ctg gag egg 
Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg 
1795 1800 1805 



5424 



ttg teg gac gee cac egg aat ggg cat egg gtg ttg gcg gtg gtt cgt 
Leu Ser Asp Ala His Arg Asn Gly His Arg Val Leu Ala Val Val Arg 
1810 1815 1820 



5472 



ggg. tea gcg gtg aat cag gac ggc gee teg aac gga ctg gcg gcg ccg 
Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ala Ala Pro 
1825 1830 1835 1840 



5520 



aac ggt ccg teg cag cag egg gtg ate aac cag gca etc gcg aat gcg 
Asn Gly Pro Ser Gin Gin Arg Val He Asn Gin Ala Leu Ala Asn Ala 
1845 1850 1855 



5568 



get ctt teg gcg tec gat gtg gat gcg gtg gag gca cat ggc ace ggg 
Ala Leu Ser Ala Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly 
1860 1865 1870 



5616 



ace agg ctg ggt gat ccg ate gag gcg cag gca ttg ate gca acg tat 
Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr 
1875 1880 1885 



5664 



ggg cag gee egg gag egg gat egg ccc ttg tgg ctg ggg teg gtc aag 
Gly Gin Ala Arg Glu Arg Asp Arg Pro Leu Trp Leu Gly Ser Val Lys 
1890 1895 1900 



5712 



teg aac ate ggt cat acg cag gee gcg gcg ggt gtt gee ggt gtg ate 
Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He 
1905 1910 1915 1920 



5760 



aag atg gtg atg gee atg egg cac ggg cag ctg ccc gee teg ctg cac 
Lys Met Val Met Ala Met Arg His Gly Gin Leu Pro Ala Ser Leu His 
1925 1930 1935 



5808 



gcg gat gag ccc acg teg gag gtc gat tgg teg teg ggg gcg gtc egg 
Ala Asp Glu Pro Thr Ser Glu Val Asp Trp Ser Ser Gly Ala Val Arg 
1940 1945 1950 



5856 



etc etc gee gaa cag gta cct tgg ccg gag tct gac cgt gtt cgt egg 5904 
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Leu Leu Ala Glu Gin Val Pro Trp Pro Glu Ser Asp Arg Val Arg Arg 
1955 1960 1965 

9 fc 9 999 gtt teg teg ttc ggg ate age ggc ace aac gca cat gtg ate 5952 

Val Gly Val Ser Ser Phe Gly lie Ser Gly Thr Asn Ala His Val He 
1970 1975 1980 



etc gaa caa get acg aat gcg cca 
Leu Glu Gin- Ala Thr Asn Ala Pro 
1985 1990 

aca gaa tec gga tct act gtc gat 
Thr Glu Ser Gly Ser Thr Val Asp 
2005 



gat agt aca gcg gag acg gac aaa 6000 
Asp Ser Thr Ala Glu Thr Asp Lys 
1995 2000 

att ccg gtc gtt ccc tgg ttg gtg 6048 
He Pro Val Val Pro Trp Leu Val 
2010 2015 



teg gga aag acg acg gat tec ctg egg gga caa gee gaa cga gtc ttg 6096 
Ser Gly Lys Thr Thr Asp Ser Leu Arg Gly Gin Ala Glu Arg Val Leu 
2020 2025 2030 

tct cag gtc gag tec egg ccg gag cag cgt teg ctg gat gtt gee tac 6144 
Ser Gin Val Glu Ser Arg Pro Glu Gin Arg Ser Leu Asp Val Ala Tyr 
2035 2040 2045 



teg ctt get tct ggc cga gee gcg ctg gat gaa cgc get gtc gtg ctg 6192 
Ser Leu Ala Ser Gly Arg Ala Ala Leu Asp Glu Arg Ala Val Val Leu 
2050 2055 2060 



ggf gcg gac cgc ggt gag ctg gtt 
Gly Ala Asp Arg Gly Glu Leu Val 
2065 2070 

ggt cag gag get tct ggg gtg ate 
Gly Gin Glu Ala Ser Gly Val He 
2085 



get gga ctg gcg gcg ttg gee gee 6240 
Ala Gly Leu Ala Ala Leu Ala Ala 
2075 2080 

age gga act cgt get tct get egg 6288 
Ser Gly Thr Arg Ala Ser Ala Arg 
2090 2095 



ttc ggg ttc gtg ttc teg ggg cag ggt ggt cag tgg ttg ggg atg ggc 6336 
Phe Gly Phe Val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly Met Gly 
2100 2105 2110 



aga gcg etc tac teg aag ttt ccg gtg ttc get get gcg ttt gat gag 6384 
Arg Ala Leu Tyr Ser Lys Phe Pro Val Phe Ala Ala Ala Phe Asp Glu 
2115 2120 2125 

get tgc gee gag ttg gag gca cat ctg ggg gaa gac cgc egg gtt egg 6432 
Ala Cys Ala Glu Leu Glu Ala His Leu Gly Glu Asp Arg Arg Val Arg 
2130 2135 2140 



gat gtg gtc ttc ggt tec gat gcg cag ctg ctg gat cag acg ctg tgg 6480 
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Asp Val Val Phe Gly Ser Asp Ala Gin Leu Leu Asp Gin Thr Leu Trp 
2145 2150 2155 2160 



gcg cag teg ggt ctg ttc gcg ctg caa gec ggc etc ttg ggg ctg etg 
Ala Gin Ser Gly Leu Phe Ala Leu Gin Ala Gly Leu Leu Gly Leu Leu 
2165 2170 2175 , 



6528 



ggt teg tgg ggc gtt egg ccg gat gtg gtg atg ggg cat teg gtc ggg 
Gly Ser Trp Gly Val Arg Pro Asp Val Val Met Gly His Ser Val Gly 
2180 2185 2190 



6576 



gag ttg gec gee gcg ttt gcg get ggc gtg ttg teg ttg egg gat gcg 
Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Leu Ser Leu Arg Asp Ala 
2195 2200 2205 



6624 



get egg ttg gtg gee gcg cgc gec egg ttg atg caa gee ctg ccc tct 
Ala Arg Leu Val Ala Ala Arg Ala Arg Leu Met Gin Ala Leu Pro Ser 
2210 2215 2220 



6672 



gac ggc gcg atg ttg gcg gtg get get ggt gaa gac ctt gtt egg cca 
Asp Gly Ala Met Leu Ala Val Ala Ala Gly Glu Asp Leu Val Arg Pro 
2225 2230 2235 2240 



6720 



ttg ctg gee ggt egg gag gag tec gtg age gtc gee gcg etc aat gee 
Leu Leu Ala Gly Arg Glu Glu Ser Val Ser Val Ala Ala Leu Asn Ala 
2245 2250 2255 



6768 



ccc ggt teg gtg gtg ttg teg ggc gat egg gag gtg ctg gee age ate 
Pro Gly Ser Val Val Leu Ser Gly Asp Arg Glu Val Leu Ala Ser He 
2260 2265 2270 



6816 



gtc ggc egg ctg ace gag etc cga gtc egg acg egg cgc ttg egg gtc 
Val Gly Arg Leu Thr Glu Leu Arg Val Arg Thr Arg Arg Leu Arg Val 
2275 2280 2285 



6864 



tec cat get ttt cat teg cac egg atg gac ccg atg ttg ggc gag ttc 
Ser His Ala Phe His Ser His Arg Met Asp Pro Met Leu Gly Glu Phe 
2290 2295 2300 



6912 



gee cag ate gee gag tct gcg gag ttc ggt aag cca acg aca ccg ctt 
Ala Gin He Ala Glu Ser Ala Glu Phe Gly Lys Pro Thr Thr Pro Leu 
2305 2310 2315 2320 



6960 



gtg teg acg ttg acg ggt gag etc gac aga gee gcg gaa atg age aca 
Val Ser Thr Leu Thr Gly Glu Leu Asp Arg Ala Ala Glu Met Ser Thr 
2325 2330 2335 



7008 



cca ggg tat tgg gtg cgc cag gcg cgt gaa ccc gtc cgt ttc gee gac 7056 
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Pro Gly Tyr Trp Val Arg Gin Ala Arg Glu Pro Val Arg Phe Ala Asp 
2340 2345 2350 



ggt gtc cag gcc ctg gca gcg cag ggc ata ggc acg gtc gtc gag etc 
Gly Val Gin Ala Leu Ala Ala Gin Gly He Gly Thr Val Val Glu Leu 
2355 2360 2365 



7104 



ggc ccg gac gga acg ctg gcg gca ctg gtt egg gag tgt gcg ace gag 
Gly Pro Asp Gly Thr Leu Ala Ala Leu Val Arg Glu Cys Ala Thr Glu 
2370 2375 2380 



7152 



tec gat egg gtt ggg egg att teg teg ate cca ctg atg cgc agg gag 
Ser Asp Arg Val Gly Arg He Ser Ser He Pro Leu Met Arg Arg Glu 
2385 2390 2395 2400 



7200 



egg gac gag acc cgt teg gtg atg aca gcc ctg gcg cat etc cac acc 
Arg Asp Glu Thr Arg Ser Val Met Thr Ala Leu Ala His Leu His Thr 
2405 2410 2415 



7248 



cgt ggt ggt gag gtg gac tgg cag gcg ttt ttc gcc ggt acc ggc get 
Arg Gly Gly Glu Val Asp Trp Gin Ala Phe Phe Ala Gly Thr Gly Ala 
2420 2425 2430 



7296 



agg cag etc gag ttg cca acg tat gcc ttc caa cga cag cac tac tgg 
Arg Gin Leu Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin His Tyr Trp 
2435 2440 2445 



7344 



ate gag tec agt gcg egg cca gca cgc gac cgc gca gac ate ggc gag 
He Glu Ser Ser Ala Arg Pro Ala Arg Asp Arg Ala Asp He Gly Glu 
2450 2455 2460 



7392 



gtg gcg gaa cag ttc tgg acc gcg gtt gac caa ggc gat ctg gca acg 
Val Ala Glu Gin Phe Trp Thr Ala Val Asp Gin Gly Asp Leu Ala Thr 
2465 2470 2475 2480 



7440 



ttg gtc gcc get ctg gat ctt ggg gcg gac gac gac aca tgc gca teg 
Leu Val Ala Ala Leu Asp Leu Gly Ala Asp Asp Asp Thr Cys Ala Ser 
2485 2490 2495 



7488 



ttg age gat gta ttg ccg gcg ttg tec tec tgg cga age gga etc cgc 
Leu Ser Asp Val Leu Pro Ala Leu Ser Ser Trp Arg Ser Gly Leu Arg 
2500 2505 2510 



7536 



aac cgt teg etc gtc gat tec tgc egg tac cga ate agt tgg cat tec 
Asn Arg Ser Leu Val Asp Ser Cys Arg Tyr Arg He Ser Trp His Ser 
2515 2520 2525 



7584 



tct egg gag gtg ccg gcc ccg aag att tec ggt acc tgg ctg ttg gtc 7632 
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Ser Arg- Glu Val Pro Ala Pro Lys He Ser Gly Thr Trp Leu Leu Val 
2530 2535 2540 



gtg ccc ggt get gcg gat gac gga ttg gtc acg get ttg acg agt tea 
Val Pro Gly Ala Ala Asp Asp Gly Leu Val Thr Ala Leu Thr Ser Ser 
2545 2550 2555 2560 



7680 



ctg gtc gga ggc ggc gee gag gtc gtc egg ate ggc ctg tec. gaa gag 
Leu Val Gly Gly Gly Ala Glu Val Val Arg He Gly Leu Ser Glu Glu 
2565 2570 2575 



7728 



gac ccg cac cgc gag gac gtc gca cag egg ctg gee aat gcg ctg acg 
Asp Pro His Arg Glu Asp Val Ala Gin Arg Leu Ala Asn Ala Leu Thr 
2580 2585 2590 



7776 



gat gee ggt caa etc ggt ggc gtg ctt teg ctg ttg ggg etc gat gaa 
Asp Ala Gly Gin Leu Gly Gly Val Leu Ser Leu Leu Gly Leu Asp Glu 
2595 2600 2605 



7824 



teg cct get ccg gga ttc tec tgc ttg cca act ggt ttc gcg ctg act 
Ser Pro Ala Pro Gly Phe Ser Cys Leu Pro Thr Gly Phe Ala Leu Thr 
2610 2615 2620 



7872 



gtg cag ctt ctg egg gee ttg egg aag gee gac gtc gag gcg cct ttt 
Val Gin Leu Leu Arg Ala Leu Arg Lys Ala Asp Val Glu Ala Pro Phe 
2625 2630 2635 2640 



7920 



tgg gcg gtg acg cgc ggc ggc gtc gcg ttg gaa gat gta cgc gtg tct 
Trp Ala Val Thr Arg Gly Gly Val Ala Leu Glu Asp Val Arg Val Ser 
2645 2650 2655 



7968 



ccg gag cag gee ctg gtc tgg ggg ctg ctg cgt gtc gcg gga ctg gag 
Pro Glu Gin Ala Leu Val Trp Gly Leu Leu Arg Val Ala Gly Leu Glu 
2660 2665 2670 



8016 



cac ccg gag ttc tgg ggt ggc ttg ate gac ctg cca teg gac tgg gac 
His Pro Glu Phe Trp Gly Gly Leu He Asp Leu Pro Ser Asp Trp Asp 
2675 2680 2685 



8064 



gac cga ttg ggt gee egg ttg gcg ggt gtg ttg gcg gat ggt ggc gag 8112 
Asp Arg Leu Gly Ala Arg Leu Ala Gly Val Leu Ala Asp Gly Gly Glu 
2690 2695 2700 

gat caa gtc gee att cgc cgt ggt ggt gtg ttc gtg egg egg ttg gaa 8160 
Asp Gin Val Ala He Arg Arg Gly Gly Val Phe Val Arg Arg Leu Glu 
2705 2710 2715 2720 



cgc get ggt gcg teg ggt gee ggg teg gtg tgg cgt cct egg ggg acg 8208 
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Arg Ala Gly Ala Ser Gly Ala Gly Ser Val Trp Arg Pro Arg Gly Thr 
2725 2730 2735 

gtg ttg gtg acg ggt ggt acg ggc ggt ttg ggg gcg cat gtt gcc egg 8256 
Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Ala His Val Ala Arg 
2740 2745 2750 

tgg ttg gcc ggt gcc ggg get gag cac gtg gtg ttg acc age cgt cga 83 04 
Trp Leu Ala Gly Ala Gly Ala Glu His Val Val Leu Thr Ser Arg Arg 
2755 * 2760 2765 

gga gcg gac get ccg ggc get ggg gaa ttg egg gcg gag ctg gag gcg 8352 
Gly Ala Asp Ala Pro Gly Ala Gly Glu Leu Arg Ala Glu Leu Glu Ala 
2770 2775 2780 



ctg ggt get egg gtg teg att gtg ccc tgc gac gtg get gat cgt gac 
Leu Gly Ala Arg Val Ser lie Val Pro Cys Asp Val Ala Asp Arg Asp 
2785 2790 2795 2800 



8400 



gca gtg get gga gtg ttg gca ggg ate ggt ggg gag tgt ccg ctg act 
Ala Val Ala Gly Val Leu Ala Gly He Gly Gly Glu Cys Pro Leu Thr 
2805 2810 2815 



8448 



gcg gtg gta cac gcc gcc ggg gtc ggc gag gcg ggc gac gta gtg gag 
Ala Val Val . His Ala Ala Gly Val Gly Glu Ala Gly Asp Val Val Glu 
2820 2825 2830 



8496 



atg ggt ttg gcg gat ttt gca gcg gtg ttg teg gcg aag gtg cgt ggt 
Met Gly Leu Ala Asp Phe Ala Ala Val Leu Ser Ala Lys Val Arg Gly 
2835 2840 2845 



8544 



gcg gcg aat ctg gac gag ttg ctg gcc gac teg gag ctg gat gcg ttt 
Ala Ala Asn Leu Asp Glu Leu Leu Ala Asp Ser Glu Leu Asp Ala Phe 
2850 2855 2860 



8592 



gtg atg ttc tec teg gtg teg ggg gtg tgg gga gcc ggc gga cag ggt 
Val Met Phe Ser Ser Val Ser Gly Val Trp Gly Ala Gly Gly Gin Gly 
2865 2870 2875 2880 



8640 



gcg tat gcg get gcg aac gcc tac ttg gat gcg ttg gcc gag cag cgt 
Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin Arg 
2885 2890 2895 



8688 



egg gcg agg gga ttg gtc ggg acc gcg gtt gcg tgg gga ccg tgg gcc 
Arg Ala Arg Gly Leu Val Gly Thr Ala Val Ala Trp Gly Pro Trp Ala 
2900 2905 2910 



8736 



ggt gac ggc atg gcc gcc ggc gaa acc ggc gca cag ctg cac egg atg 8784 
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Gly Asp Gly Met Ala Ala .Gly Glu Thr Gly Ala Gin Leu His Arg Met 
2915 2920 2925 

ggc ctg gcg teg atg gaa ccg age gcg gcg ctg ctg gca ctt cag ggt 8832 
Gly Leu Ala Ser Met Glu Pro Ser Ala Ala Leu Leu Ala Leu Gin Gly 
2930 2935 2940 

gca ttg gac cgc gat gag ace tec etc gtc gtg gee gat gtc gat tgg 8880 
Ala Leu Asp Arg Asp Glu Thr Ser Leu Val Val Ala Asp Val Asp Trp 
2945 2950 2955 2960 

gca egg ttc gee cca gee ttc ace teg gca cgt cga cgc ccg ctg ctg 8928 
Ala Arg Phe Ala Pro Ala Phe Thr Ser Ala Arg Arg Arg Pro Leu Leu 
2965 2970 2975 

gac ace ate gac gag gec cga gec gca ttg gaa acc ace ggc gaa caa 8976 
Asp Thr lie Asp Glu Ala Arg Ala Ala Leu Glu Thr Thr Gly Glu Gin 
2980 2985 2990 

gcg ggc aca ggc aaa ccc gtt gag ctg acg caa cgc ctg gee gga ctg 9024 
Ala Gly Thr Gly Lys Pro Val Glu Leu Thr Gin Arg Leu Ala Gly Leu 
2995 3000 3005 

teg egg aag gaa cgc gac gat gcg gta ttg gat ctg gtg egg gcg gag 9072 
Ser Arg Lys Glu Arg Asp Asp Ala Val Leu Asp Leu Val Arg Ala Glu 
3010 3015 3020 

acg gcg get gtg ctg gga cgc gac gat gec acg gee ctg gcg cca teg 9120 
Thr Ala Ala Val .Leu Gly Arg Asp Asp Ala Thr Ala Leu Ala Pro Ser 
3025 3030 3035 3040 

egg ccg ttc cag gaa etc gga ttc gac tec ttg atg gcg gtg gag ctg 9168 
Arg Pro Phe Gin Glu Leu Gly Phe Asp Ser Leu Met Ala Val Glu Leu 
3045 3050 3055 

cgc aac egg ctg aac acc gee acc ggg ate cag ctg ccc gee age acg 9216 
Arg Asn Arg Leu Asn Thr Ala Thr Gly lie Gin Leu Pro Ala Ser Thr 
3060 3065 3070 

att ttc gac tac ccc aat gee gag teg ctg teg cgt cac etc tgc gec 9264 
lie Phe Asp Tyr Pro Asn Ala Glu Ser Leu Ser Arg His Leu Cys Ala 
3075 3080 3085 

gag ctt ttc cca acg gag act acc gtg gac teg gec ctt gee gag etc 9312 
Glu Leu Phe Pro Thr Glu Thr Thr Val Asp Ser Ala Leu Ala Glu Leu 
3090 3095 3100 

gat cga ate gag cag cag etc teg atg etc acc ggc gaa gcg egg gca 9360 



212 



WO 01/16303 



PCT/EPOO/08013 



Asp Arg lie Glu Gin Gin Leu Ser Met Leu Thr Gly Glu Ala Arg Ala 
3105 3110 3115 3120 

c 99 9 a c cga ate gcg aca cga ctg cga gec etc cac gag aag tgg aac 9408 
Arg Asp Arg He Ala Thr Arg Leu Arg Ala Leu His Glu Lys Trp Asn 
3125 3130 3135 

age gca get gaa gta ccg ace gga gee gat gtc ctg age acg etc gat 9456 
Ser Ala Ala Glu Val Pro Thr Gly Ala Asp Val Leu Ser Thr Leu Asp 
3140 3145 3150 

teg gcg acg cac gac gag ata ttc gag ttc ate gac aac gag etc gac 9504 
Ser Ala Thr His Asp Glu He Phe Glu Phe He Asp Asn Glu Leu Asp 
3155 3160 3165 

ctg tec tga 9513 
Leu Ser 
3170 



<210> 46 
<211> 3170 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 46 

Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala Leu Val 
15 10 15 

Asp Leu His Gin Ala Arg Glu Arg Leu His Glu Ala Glu Ser Gly Glu 
20 25 30 

Arg Glu Pro He Ala lie Val Ala Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 

Val Gin Asp Pro Glu Gly Leu Trp Lys Leu Val Ala Ser Gly Gly Asp 
50 55 60 

Ala He Gly Glu Phe Pro Ala Asp Arg Gly Trp His Leu Asp GlM Leu 
65 70 75 80 

Tyr Asp Pro Asp Pro Asp Gin Pro Gly Thr Cys Tyr Thr Arg His Gly 
85 90 95 

Gly Phe Leu His Asp Ala Gly Glu Phe Asp Ala Gly Phe Phe Asp He 
100 105 110 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
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115 120 125 

Glu He Ser Trp Glu Thr Val Glu Ser Ala Gly Met Asp Pro Arg Ser 
130 135 140 

Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr Glu Gly 
145 150 155 160 



Tyr Asp Thr Gly Ala His Arg Ala Gly Glu Gly Val Glu Gly Tyr Leu 
165 170 175 

Gly Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Ala 
180 185 190 

Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Gin Gly Glu 
210 215 220 

Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser Thr Pro Glu 
225 230 235 240 

Arg Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 

Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly 
290. 295 300 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Leu Ala Gin Glu Arg Val 
305 310 315 320 

He Gin Gin Val Leu Thr Ser Ala Gly Leu Ser Ala Ser Asp Va'l Asp 
325 330 335 



Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu 
340 345 350 

Ala Gin Ala Leu He Ala Ala Tyr Gly Gin Asp Arg Asp Arg Asp Arg 
355 360 365 

Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly His Thr Gin Ala 
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370 375 380 

Ala Ala Gly Val Ala Gly Val He Lys Met Val Met Ala Met Arg His 
385 390 395 400 

Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro Asn Ser His Val 
405 410 415 

Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Glu Asn He Arg Trp 
420 425 430 

Pro Gly Thr Gly Thr Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser 
435 440 445 

Gly Thr Asn Ala His Val He Leu Glu His Asp Pro Leu Ala Val Thr 
450 455 460 

Glu Asn Glu Glu Ala Ala Gin Ser Pro Ala Pro Gly He Val Pro Trp 
465 - 470 475 480 

Ala Leu Ser Gly Arg Ser Ser Thr Ala Leu Arg Ala Gin Ala Glu Arg 
485 490 495 

Leu Arg Glu Leu Cys Glu Gin Thr Asp Pro Asp Pro Val Asp Val Gly 
500 505 510 

Phe Ser Leu Ala Ala Thr Arg Thr Ala Trp Glu His Arg Ala Val Val 
515 520 525 

Leu Gly Arg Asp Ser Ala Thr Leu Arg Ser Gly Leu Gly Val Val Ala 
530 535 540 

Ser Gly Glu Pro Ala Val Asp Val Val Glu Gly Ser Val Leu Asp Gly 
545 550 555 560 

Glu Val Val Phe Val Phe Pro Gly Gin Gly Trp Gin Trp Ala Gly Met 
565 570 575 

Ala Val Asp Leu Leu Asp Ala Ser Pro Thr Phe Ala Arg His Met Asp 
580 585 590 

Glu Cys Ala Thr Ala Leu Arg Arg Tyr Val Asp Trp Ser Leu Val Asp 
595 600 605 

Val Leu Arg Gly Ala Glu Asn Ser Pro Pro Leu Asp Arg Val Asp Val 
610 615 620 

Leu Gin Pro Ala Ser Phe Ala Val Met Val Ser Leu Ala Glu Val Trp 
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625 



630 



635 



640 



Arg Ser Tyr Gly Val Arg Pro Ala Ala Val Val Gly His Ser Gin Gly 
645 . 650 655 

Glu He Ala Ala Ala Cys Ala Ala Gly Val Leu Pro Leu Glu Asp Ala 
660 665 670 

Ala Arg Leu Val Ala Leu Arg Ser Arg Ala Leu Lys Gly Leu Ser Gly 
675 680 685 

Arg Gly Gly Met Ala Ser Leu Ala Cys Pro Ala Asp Glu Val Ala Ala 
690 695 700 

Leu Phe Ala Gly Ser Gly Gly Arg Leu Glu Val Ala Ala He Asn Gly 
705 710 715 720 

Pro Arg Ser Val Val Val Ser Gly Asp Leu Glu Ala Val Asp Glu Leu 
725 ^ 730 735 

Leu Ala Glu Cys Ala Glu Lys Asp Met Arg Ala Arg Arg He Pro Val 
740 745 750 

Asp Tyr Ala Ser His Ser Ala His Val Glu Val Val Arg Ser Pro Val 
755 760 765 

Leu Ala Ala Ala Ala Gly Val Arg His Arg Asp Gly Gin Val Pro Trp 
770 775 780 

Trp Ser Thr Val He Gly Asp Trp Val Asp Pro Ala Arg Leu Asp Gly 
785 790 795 " 800 

Glu Tyr Trp Tyr Arg Asn Leu Arg Gin Pro Val Arg Phe Glu His Ala 
805 810 815 

Val Gin Gly Leu Val Glu Arg Gly Phe Gly Leu Phe He Glu Met Ser 
820 825 830 

Ala His Pro Val Leu Thr Thr Ala Val Glu Glu Thr Gly Ala Glut Ser 
835 840 845 

Glu Thr Ala Val Ala Ala Val Gly Thr Leu Arg Arg Asp Ser Gly Gly 
850 855 860 

Leu Arg Arg Leu Leu His Ser Leu Ala Glu Ala Tyr Val Arg Gly Ala 
865 870 875 880 



Thr Val Asp Trp Ala Val Ala Phe Gly Gly Ala Gly Arg Arg Leu Asp 
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885 890 895 

Leu Pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Asp Lys Gly 
900 905 910 

Ala Ala Ser Asp Glu Ala Arg Ala Val Ser Asp Pro Ala Ala Gly Trp 
915 920 925 

Phe Trp Gin Ala Val Ala Arg Gin Asp Leu Lys Ser Val Ser Asp Ala 
930 935 940 

Leu Asp Leu Asp Ala Asp Ala Pro Leu Ser Ala Thr Leu Pro Ala Leu 
945 950 955 960 

Ser Val Trp His Arg Gin Glu Arg Glu Arg Val Leu Ala Asp Gly Trp 
965 970 975 

Arg Tyr Arg Val Asp Trp Val Arg Val Ala Pro Gin Pro Val Arg Arg 
980 985 990 

Thr Arg Glu Thr Trp Leu Leu Val Val Pro Pro Gly Gly lie Glu Glu 
995 1000 1005 

Ala Leu Val Glu Arg Leu Thr Asp Ala Leu Asn Thr Arg Gly lie Ser 
1010 1015 1020 

Thr Leu Arg Leu Asp Val Pro Pro Ala Ala Thr Ser Gly Glu Leu Ala 
025 1030 1035 1040 

Thr Glu Leu Arg Ala Ala Ala Asp Gly Asp Pro Val Lys Ala lie Leu 
1045 1050 1055 

Ser Leu Thr Ala Leu Asp Glu Arg Pro His Pro Glu Cys Lys Asp Val 
1060 1065 1070 

Pro Ser Gly He Ala Leu Leu Leu Asn Leu Val Lys Ala Leu Gly Glu 
1075 1080 1085 

Ala Asp Leu Arg He Pro Leu Trp Thr He Thr Arg. Gly Ala Va'l Lys 
1090 1095 1100 

Ala Gly Pro Ala Asp Arg Leu Leu Arg Pro Met Gin Ala Gin Ala Trp 
105 1110 1115 1120 

Gly Leu Gly Arg Val Ala Ala Leu Glu His Pro Glu Arg Trp Gly Gly 
1125 1130 1135 

Leu He Asp Leu Pro Asp Ser Leu Asp Gly Asp Val Leu Thr Arg Leu 
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1140 1145 1150 

Gly Glu Ala Leu Thr Asn Gly Leu Ala Glu Asp Gin Leu Ala He Arg 
1155 . 1160 1165 

Gin Ser Gly Val Leu Ala Arg Arg Leu Val Pro Ala Pro Ala Asn Gin 
1170 1175 1180 

Pro Ala Gly Arg Lys Trp Arg Pro Arg Gly Ser Ala Leu He Thr Gly 
185. 1190 1195 1200 

Gly Leu Gly Ala Val Gly Ala Gin Val Ala Arg Trp Leu Ala Glu He 
1205 1210 1215 

Gly Ala Glu Arg He Val Leu Thr Ser Arg Arg Gly Asn Gin Ala Ala 
1220 1225 1230 

Gly Ala Ala Glu Leu Glu Ala Glu Leu Arg Ala Leu Gly Ala Gin Val 
1235 1240 1245 

Ser He Val Ala Cys Asp Val Thr Asp Arg Ala Glu Met Ser Ala Leu 
1250 1255 1260 

Leu Ala Glu Phe Asp Val Thr Ala Val Phe His Ala Ala Gly Val Gly 
265 1270 1275 1280 

Arg Leu Leu Pro Leu Ala Glu Thr Asp Gin Asn Gly Leu Ala Glu He 
1285 1290 1295 

Cys Ala Ala Lys Val Arg Gly Ala Gin Val Leu Asp Glu Leu Cys Asp 
1300 1305 1310 

Ser Thr Asp Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Gly Val 
1315 1320 1325 

Trp Gly Gly Gly Gly Gin Gly Ala Tyr Gly Ala Ala Asn Ala Phe Leu 
1330 1335 1340 

Asp Thr Leu Ala Glu Gin Arg Arg Ala Arg Gly Leu Pro Ala Thr Ser 
345 1350 1355 1360 

He Ser Trp Gly Ser Trp Ala Gly Gly Gly Met Ala Asp Gly Ala Ala 
1365 1370 1375 

Gly Glu His Leu Arg Arg Arg Gly lie Arg Pro Met Pro Ala Ala Ser 
1380 1385 1390 

Ala He Leu Ala Leu Gin Glu Val Leu Asp Gin Asp Glu Thr Cys Val 
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1395 1400 1405 

Ser lie Ala Asp Val Asp Trp Asp Arg Phe Val Pro Thr Phe Ala Ala 
1410 1415 1420 

Thr Arg Ala Thr Arg Leu Phe Asp Glu Val Pro Ala Ala Arg Lys Ala 
425 1430 1435 1440 

Met Pro Ala Asn Gly Pro Ala Glu Pro Gly Gly Ser Pro Phe Ala Arg 
1445 1450 1455 

Asn Leu Ala Glu Leu Pro Glu Ala Gin Arg Arg His Glu Leu Val Asp 
1460 1465 1470 

Leu Val Cys Ala Gin Val Ala Thr Val Leu Gly His Gly Ser Arg Glu 
1475 1480 1485 

Glu Val Gin Pro Glu Arg Ala Phe Arg Ala Leu Gly Phe Asp Ser Leu 
1490 1495 1500 

Met Ala Val Asp Leu Arg Asn Arg Leu Thr Thr Ala Thr Gly Leu Arg 
505 1510 1515 1520 

Leu Pro Thr Thr Thr Val Phe Asp Tyr Pro Asn Pro Ala Ala Leu Ala 
1525 1530 1535 

Ala His Leu Leu Glu Glu Leu Val Gly Asp Val Ala Ser Ala Ala Val 
1540 1545 1550 

Thr Ala Ala Ser Ala Pro Ala Ser Asp Glu Pro He Ala He Val Ala 
1555 1560 1565 

Met Ser Cys Arg Phe Pro Gly Gly Ala His Ser Pro Glu Asp Leu Trp 
1570 1575 1580 

Arg Leu Val Ala Ala Gly Thr Glu Val He Gly Glu Phe Pro Ser Asp 
585 1590 1595 1600 

Arg Gly Trp Asp Ala Glu Gly Leu Tyr Asp Pro Asp Ala Ser Arg Pro 
1605 1610 1615 

Gly Thr Thr Tyr Ala Arg Met Ala Gly Phe Leu Tyr Asp Ala Gly Glu 
1620 1625 1630 

Phe Asp Ala Asp Leu Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

Asp Pro Gin Gin Arg Leu Val Leu Glu He Ala Trp Glu Ala Leu Glu 
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1650 1655 1660 

Arg Ala Gly lie Asp Pro Leu Ser Leu Lys Gly Ser Gly Val Gly Thr 
665 1670 1675 1680 

Tyr He Gly Ala Gly Ser Arg Gly Tyr Ala Thr Asp Val Arg Gin Phe 
1685 1690 1695 

Pro Glu Glu Ala Glu Gly Tyr Leu Leu Thr Gly Thr Ser Ala Ser Val 
1700 t 1705 1710 



Leu Ser Gly Arg Val Ala Tyr Ser 
1715 1720 

Thr Val Asp Thr Ala Cys Ser Ser 
1730 1735 

Cys Gin Ser Leu Arg Ser Gly Glu 
745 1750 

Val Thr Val Met Ser Thr Pro Glu 
1765 



Phe Gly Phe Glu Gly Pro Ala Val 
1725 

Ser Leu Val Ala Leu His Leu Ala 
1740 

Cys Asp Leu Ala Leu Ala Gly Gly 
1755 1760 

Met Phe Val Glu Phe Ser Arg Gin 
1770 1775 



Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala 
1780 1785 1790 

Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg 
1795 1800 1805 

Leu Ser Asp Ala His Arg Asn Gly His Arg Val Leu Ala Val Val Arg 
1810 1815 1820 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ala Ala Pro 
825 1830 1835 1840 

Asn Gly Pro Ser Gin Gin Arg Val lie Asn Gin Ala Leu Ala Asn Ala 
1845 1850 1855 

Ala Leu Ser Ala Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly 
1860 1865 1870 

Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr 
1875 1880 1885 

Gly Gin Ala Arg Glu Arg Asp Arg Pro Leu Trp Leu Gly Ser Val Lys 
1890 1895 1900 

Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He 
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905 1910 1915 1920 

Lys Met Val Met Ala Met Arg His Gly Gin Leu Pro Ala Ser Leu His 
1925 1930 1935 

Ala Asp Glu Pro Thr Ser Glu Val Asp Trp Ser Ser Gly Ala Val Arg 
1940 1945 1950 

Leu Leu Ala Glu Gin Val Pro Trp Pro Glu Ser Asp Arg Val Arg Arg 
1955 1960 1965 

Val Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Val He 
1970 1975 1980 

Leu Glu Gin Ala Thr Asn Ala Pro Asp Ser Thr Ala Glu Thr Asp Lys 
985 1990 1995 2000 

Thr Glu Ser Gly Ser Thr Val Asp He Pro Val Val Pro Trp Leu Val 
2005 2010 2015 

Ser Gly Lys Thr Thr Asp Ser Leu Arg Gly Gin Ala Glu Arg Val Leu 
2020 2025 2030 

Ser Gin Val Glu Ser Arg Pro Glu Gin Arg Ser Leu Asp Val Ala Tyr 
2035 2040 2045 

Ser Leu Ala Ser Gly Arg Ala Ala Leu Asp Glu Arg Ala Val Val Leu 
2050 2055 2060 

Gly Ala Asp Arg Gly Glu Leu Val Ala Gly Leu Ala Ala Leu Ala Ala 
065 2070 2075 2080 

Gly Gin Glu Ala Ser Gly Val He Ser Gly Thr Arg Ala Ser Ala Arg 
2085 2090 2095 

Phe Gly Phe Val Phe Ser Gly Gin Gly Gly Gin Trp Leu Gly Met Gly 
2100 2105 2110 

Arg Ala Leu Tyr Ser Lys Phe Pro Val Phe Ala Ala Ala Phe Asp Glu 
2115 2120 2125 

Ala Cys Ala Glu Leu Glu Ala His Leu Gly Glu Asp Arg Arg Val Arg 
2130 2135 2140 

Asp Val Val Phe Gly Ser Asp Ala Gin Leu Leu Asp Gin Thr Leu Trp 
145 2150 2155 2160 

Ala Gin Ser Gly Leu Phe Ala Leu Gin Ala Gly Leu Leu Gly Leu Leu 
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2165 2170 2175 

Gly Ser Trp Gly Val Arg Pro Asp Val Val Met Gly His Ser Val Gly 
2180 2185 2190 

Glu Leu Ala Ala Ala Phe Ala Ala Gly Val Leu Ser Leu Arg Asp Ala 
2195 2200 2205 

Ala Arg Leu Val Ala Ala Arg Ala Arg Leu Met Gin Ala Leu Pro Ser 
2210 2215 2220 

Asp Gly Ala Met Leu Ala Val Ala Ala Gly Glu Asp Leu Val Arg Pro 
225 2230 2235 2240 

Leu Leu Ala. Gly Arg Glu Glu Ser Val Ser Val Ala Ala Leu Asn Ala 
2245 2250 2255 

Pro Gly Ser Val Val Leu Ser Gly Asp Arg Glu Val Leu Ala Ser He 
2260 2265 2270 

Val Gly Arg Leu Thr Glu Leu Arg Val Arg Thr Arg Arg Leu Arg Val 
2275 2280 2285 

Ser His Ala Phe His Ser His Arg Met Asp Pro Met Leu Gly Glu Phe 
2290 2295 2300 

Ala Gin He Ala Glu Ser Ala Glu Phe Gly Lys Pro Thr Thr Pro Leu 
305 2310 2315 2320 

Val Ser Thr Leu Thr Gly Glu Leu Asp Arg Ala Ala Glu Met Ser Thr 
2325 2330 2335 

Pro Gly Tyr Trp Val Arg Gin Ala Arg Glu Pro Val . Arg Phe Ala Asp 
2340 2345 2350 

Gly Val Gin Ala Leu Ala Ala Gin Gly He Gly Thr Val Val Glu Leu 
2355 2360 2365 

Gly Pro Asp Gly Thr Leu Ala Ala Leu Val Arg Glu Cys Ala Thr Glu 
2370 2375 2380 

Ser Asp Arg Val Gly Arg He Ser Ser He Pro Leu Met Arg Arg Glu 
385 2390 2395 2400 

Arg Asp Glu Thr Arg Ser Val Met Thr Ala Leu Ala His Leu His Thr 
2405 2410 2415 

Arg Gly Gly Glu Val Asp Trp Gin Ala Phe Phe Ala Gly Thr Gly Ala 
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2420 2425 2430 

Arg Gin Leu Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin His Tyr Trp 
2435 2440 2445 

lie Glu Ser Ser Ala Arg Pro Ala Arg Asp Arg Ala Asp lie Gly Glu 
2450 2455 2460 

Val Ala Glu Gin Phe Trp Thr Ala Val Asp Gin Gly Asp Leu Ala Thr 
465 2470 2475 2480 

Leu Val "Ala Ala Leu Asp Leu Gly Ala Asp Asp Asp Thr Cys Ala Ser 
2485 2490 2495 

Leu Ser Asp Val Leu Pro Ala Leu Ser Ser Trp Arg Ser Gly Leu Arg 
2500 2505 2510 

Asn Arg Ser Leu Val Asp Ser Cys Arg Tyr Arg He Ser Trp His Ser 
2515 2520 2525 

Ser Arg Glu Val Pro Ala Pro Lys He Ser Gly Thr Trp Leu Leu Val 
2530 2535 2540 

Val Pro Gly Ala Ala Asp Asp Gly Leu Val Thr Ala Leu Thr Ser Ser 
545 2550 2555 2560 

Leu Val Gly Gly Gly Ala Glu Val Val Arg He Gly Leu Ser Glu Glu 
2565 2570 2575 

Asp Pro His Arg Glu Asp Val Ala Gin Arg Leu Ala Asn Ala Leu Thr 
2580 2585 2590 

Asp Ala Gly Gin Leu Gly Gly Val Leu Ser Leu Leu Gly Leu Asp Glu 
2595 2600 2605 

Ser Pro Ala Pro Gly Phe Ser Cys Leu Pro Thr Gly Phe Ala Leu Thr 
2610 2615 2620 

Val Gin Leu Leu Arg Ala Leu Arg Lys Ala Asp Val Glu Ala Pro Phe 
625 2630 2635 2640 

Trp Ala Val Thr Arg Gly Gly Val Ala Leu Glu Asp Val Arg Val Ser 
2645 2650 2655 

Pro Glu Gin Ala Leu Val Trp Gly Leu Leu Arg Val Ala Gly Leu Glu 
2660 2665 2670 

His Pro Glu Phe Trp Gly Gly Leu He Asp Leu Pro Ser Asp Trp Asp 
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2675 2680 2685 

Asp Arg Leu Gly Ala Arg Leu Ala Gly Val Leu Ala Asp Gly Gly Glu 
2690 2695 2700 

Asp Gin Val Ala He Arg Arg Gly Gly Val Phe Val Arg Arg Leu Glu 
705 2710 2715 2720 

Arg Ala Gly Ala Ser Gly Ala Gly Ser Val Trp Arg Pro Arg Gly Thr 
2725 2730 2735 

Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Ala His Val Ala Arg 
2740 2745 2750 

Trp Leu Ala Gly Ala Gly Ala Glu His Val Val Leu Thr Ser Arg Arg 
2755 2760 2765 

Gly Ala Asp Ala Pro Gly Ala Gly Glu Leu Arg Ala Glu Leu Glu Ala 
2770 2775 2780 

Leu Gly Ala Arg Val Ser He Val Pro Cys Asp Val Ala Asp Arg Asp 
785 2790 2795 2800 

Ala Val Ala Gly Val Leu Ala Gly He Gly Gly Glu Cys Pro Leu Thr 
2805 2810 2815 

Ala Val Val His Ala Ala Gly Val Gly Glu Ala Gly Asp Val Val Glu 
2820 2825 2830 

Met Gly Leu Ala Asp Phe Ala Ala Val Leu Ser Ala Lys Val Arg Gly 
2835 2840 2845 

Ala Ala Asn Leu Asp Glu Leu Leu Ala Asp Ser Glu Leu Asp Ala Phe 
2850 2855 2860 

Val Met Phe Ser Ser Val Ser Gly Val Trp Gly Ala Gly Gly Gin Gly 
865 2870 2875 2880 

Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Gin Arg 
2885 2890 2895 

Arg Ala Arg Gly Leu Val Gly Thr Ala Val Ala Trp Gly Pro Trp Ala 
2900 2905 2910 

Gly Asp Gly Met Ala Ala Gly Glu Thr Gly Ala Gin Leu His Arg Met 
2915 2920 2925 

Gly Leu Ala Ser Met Glu Pro Ser Ala Ala Leu Leu Ala Leu Gin Gly 
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2930 2935 2940 

Ala Leu Asp Arg Asp Glu Thr Ser Leu Val Val Ala Asp Val Asp Trp 
945 2950 2955 2960 

Ala Arg Phe Ala Pro Ala Phe Thr Ser Ala Arg Arg Arg Pro Leu Leu 
2965 2970 2975 

Asp Thr lie Asp Glu Ala Arg Ala Ala Leu Glu Thr Thr Gly Glu Gin 
2980 2985 2990 

Ala Gly Thr Gly Lys Pro Val Glu Leu Thr Gin Arg Leu Ala Gly Leu 
2995 3000 3005 

Ser Arg Lys Glu Arg Asp Asp Ala Val Leu Asp Leu- Val Arg Ala Glu 
3010 3015 3020 

Thr Ala Ala Val Leu Gly Arg Asp Asp Ala Thr Ala Leu Ala Pro Ser 
025 3030 3035 3040 

Arg Pro Phe Gin Glu Leu Gly Phe Asp Ser Leu Met Ala Val Glu Leu 
3045 3050 3055 

Arg Asn Arg Leu Asn Thr Ala Thr Gly lie Gin Leu Pro Ala Ser Thr 
3060 3065 3070 

lie Phe Asp Tyr Pro Asn Ala Glu Ser Leu Ser Arg His Leu Cys Ala 
3075 3080 3085 

Glu Leu Phe Pro Thr Glu Thr Thr Val Asp Ser Ala Leu Ala Glu Leu 
3090 3095 3100 

Asp Arg lie Glu Gin Gin Leu Ser Met Leu Thr Gly Glu Ala Arg Ala 
105 3110 3115 3120 

Arg Asp Arg lie Ala Thr Arg Leu Arg Ala Leu His Glu Lys Trp Asn 
3125 3130 3135 

Ser Ala Ala Glu Val Pro Thr Gly Ala Asp Val Leu Ser Thr Leu Asp 
3140 3145 3150 

Ser Ala Thr His Asp Glu lie Phe Glu Phe lie Asp Asn Glu Leu Asp 
3155 3160 3165 

Leu Ser 
3170 
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<210> 47 
<211> 14775 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1).. (14772) 

<223> ORF21; Polyketidsynthase 

<400> 47 

atg gcc aat gaa gaa aag etc ttc ggc tat ctg aag aag gta act gcg 4 8 
Met Ala Asn Glu Glu Lys Leu Phe Gly Tyr Leu Lys Lys Val Thr Ala 
15 10 15 

gac ctg cat cag acc egg cag cgc ctg etc gcg gcc gag age egg agt 96 
Asp Leu His Gin Thr Arg Gin Arg Leu Leu Ala Ala Glu Ser Arg Ser 
20 25 30 

cag gag ccg ate gcg ate gtc teg gcg age tgc cga ctg ccc ggc ggc 144 
Gin Glu Pro He Ala He Val Ser Ala Ser Cys Arg Leu Pro Gly Gly 
35 40 45 

gtc gac tct ccc gaa gcg etc tgg caa etc gtg cgc act ggc acc gac 192 
Val Asp Ser Pro Glu Ala Leu Trp Gin Leu Val Arg Thr Gly Thr Asp 
50 55 60 

gcc ate teg gag ttc ccc gcc gac egg ggc tgg gat etc ggc egg ttg 240 
Ala He Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp Leu Gly Arg Leu 
65 70 75 80 

tac gat ccc gac ccg aac cac cag gga acg teg tac acg egg gcc ggc 288 
Tyr Asp Pro Asp Pro Asn His Gin Gly Thr Ser Tyr Thr Arg Ala Gly 
85 90 95 

ggt ttc etc gca gga gcg ggc gat ttc gac ccc gcc atg ttc ggg att 336 
Gly Phe Leu Ala Gly Ala Gly Asp Phe Asp Pro Ala Met Phe Gly He 
100 105 110 

teg ccg cgt gag gcg ttg gcg atg gac ccg cag caa egg ttg ttg ctg 3 84 
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

gag ctg tec tgg gag gcc etc gaa egg gcg ggc ata gac ccg aca tec 432 
Glu Leu Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro Thr Ser 
130 135 140 
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ctg cgc ggc age aag acc ggt gtc ttc ggt ggt gtc acg ccc cag gag 480 
Leu Arg Gly Ser Lys Thr Gly Val Phe Gly Gly Val Thr Pro Gin Glu 
145 150 155 160 

tac ggg ccg tec ttg cag gag atg age cga aac get ggg ggt ttt gga 52 8 
Tyr Gly Pro Ser Leu Gin Glu Met Ser Arg Asn Ala Gly Gly Phe Gly 
165 170 175 

etc acc ggg egg atg gtg agt gtg gcg teg ggt egg gtt gcg tat teg 576 
Leu Thr Gly Arg Met Val Ser Val Ala Ser Gly Arg Val Ala Tyr Ser 
180 185 190 

ttt ggt ttt gag ggt cct gcg gtg acg gtg gat acg gcg tgt teg teg 624 
Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

teg ttg gtg gee ctg cat ttg gcg tgt cag teg ttg cgt tec ggc gaa 672 
Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu 
210 215 220 

tgc gat etc gcg ctg gee ggc ggt gtg acg gtg atg gcg aca ccg gcg 720 
Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala 
225 230 235 240 

acg ttc gtg gag ttc tec cgt cag cgt ggt ttg get ccg gac ggg egg 768 
Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

tgc aag teg ttc gcg get gee gcg gat ggc acc ggg tgg ggt gag ggt 816 
Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 

gec ggt ctg gtg ttg ctg gag egg ttg teg gat gcg egg egg aat ggg 864 
Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

cac gag gtt ctg gcg gtg gtg egg ggt age gcg gtg aac cag gac ggc 912 
His Glu Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly 
290 295 300 

gcg teg aat ggt ttg act gcg ccg aat ggt ccg teg cag cag egg gtg 960 
Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val 
305 310 315 320 

ate acc cag gcg ttg gcg agt gcg ggg ctg teg gtt tec gat gtg gat 1008 
lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser Val Ser Asp Val Asp 
325 330 335 
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gcg gtc gag gca cat ggg acc ggg acc acg ttg ggt gat ccg ate gag 1056 
Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro He Glu 
340 345 350 

gca cag gec ctg ate gee acg tac ggg cag ggc egg gag aag gat egg 1104 
Ala Gin Ala Leu He Ala Thr Tyr Gly Gin Gly Arg Glu Lys Asp Arg 
355 360 365 

ccg ttg tgg ttg ggg teg gtc aag tec aac ate ggt cac acg cag gcg 1152 
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly His Thr Gin Ala 
370 375 380 

gee get ggc gtt gee ggc gtc ate aag atg gtc ttg gcg atg egg cac 1200 
Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg His 
385 390 395 400 

ggg cag ctg ccc gee acg ttg cat gtg gat gag ccc acg teg gcg gtg 1248 
Gly Gin Leu Pro Ala Thr Leu His Val Asp Glu Pro Thr Ser Ala Val 
405 410 415 

gac tgg teg gcg ggt teg gtc egg ctt etc acg gag aac acg ccc tgg 12 96 
Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Asn Thr Pro Trp 
420 425 430 

ccg gac agt ggt cgt cct tgc egg gtg ggg gtg teg teg ttc ggg ate 1344 
Pro Asp Ser Gly Arg Pro Cys Arg Val Gly Val Ser Ser Phe Gly He 
435 440 445 

age ggc acc aac gca cat gtg att etc gaa cag tct cca gtc gag cag 13 92 
Ser Gly Thr Asn Ala His Val He Leu Glu Gin Ser Pro Val Glu Gin 
450 455 460 

ggc gaa ccg gee ggg ccg gtc gaa ggc gag egg gaa ccg gat gta gee 144 0 
Gly Glu Pro Ala Gly Pro Val Glu Gly Glu Arg Glu Pro Asp Val Ala 
465 470 475 480 

gtc ccc gtg gtg cct tgg gtg ctg teg ggt aag aca ccg gag get gcg 1488 
Val Pro Val Val Pro Trp Val Leu Ser Gly Lys Thr Pro Glu Ala Ala 
485 490 4*5 

egg gcg cag gee gaa egg gtg cat teg cat ate gag gac egg ccg ggg 153 6 
Arg Ala Gin Ala Glu Arg Val His Ser His He Glu Asp Arg Pro Gly 
500 505 510 

ctg teg ccg gtg gat gtg gcg tat teg eta gga atg aca cgc gcg gcg 1584 
Leu Ser Pro Val Asp Val Ala Tyr Ser Leu Gly Met Thr Arg Ala Ala 
515 520 525 
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ctg gat gaa cgc gca gtg gtg ttg ggc teg gac cgt gec gcg etc ctg 1632 
Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Ala Ala Leu Leu 
530 535 540 

acc ggg ttg agg gca ttc gec gac ggc tgc gat gcg ccc gaa gtg gtt 1680 
Thr Gly Leu Arg Ala Phe Ala Asp Gly Cys Asp Ala Pro Glu Val Val 
545 550 555 560 

teg ggg tct gtg ggg ctt ggt ggc cgc gtc ggg ttc gtg ttc teg ggt 1728 
Ser Gly Ser Val Gly Leu Gly Gly Arg Val Gly Phe Val Phe Ser Gly 
565 570 575 

cag ggt ggt cag tgg ccg ggg atg ggc egg ggg etc tac teg gtg ttt 1776 
Gin Gly Gly Gin Trp Pro Gly Met Gly Arg Gly Leu Tyr Ser Val Phe 
580 585 590 

ccg gtg ttc gee gac gcg ttc gac gag get tgc gcg gag ttg gat gca 1824 
Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp Ala 
595 600 605 

cac ctg ggc cag gaa ctg egg gtt egg gat gtg gtg ttc ggt teg caa 1872 
His Leu Gly Gin Glu Leu Arg Val Arg Asp Val Val Phe Gly Ser Gin 
610 615 620 

gcg tgg ttg ctg gat egg acg gtg tgg gcg cag teg ggt ttg ttc gcg 192 0 
Ala Trp Leu Leu Asp Arg Thr Val Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

ttg cag att ggc ttg ctg egg ctg ctg ggt teg tgg ggt gtt egg ccg 1968 
Leu Gin lie Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly Val Arg Pro 
645 650 655 

gat gtg gtg ttg ggg cac teg gtg ggt gag ctg get gcg gtg cat gcg 2016 
Asp Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Val His Ala 
660 665 670 

get ggt gtg ttg teg ttg teg gag gee gcg egg ttg gtg gcg ggt cgc 2 064 
Ala Gly Val Leu Ser Leu Ser Glu Ala Ala Arg Leu Val Ala Gly Arg 
675 680 685 

gee egg ttg atg cag gcg ttg cct tct ggt ggt gec atg etc gcg gtc 2112 
Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala Val 
690 695 700 

get acg ggt gag ttt cag gtc gat cct ctg ctg gat ggg gtg egg gac 2160 
Ala Thr Gly Glu Phe Gin Val Asp Pro Leu Leu Asp Gly Val Arg Asp 
705 710 715 720 
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egg ate ggt ate gcg gcg gtg aat ggc ccg gaa teg gtt gtg etc tct 2208 

Arg lie Gly He Ala Ala Val Asn Gly Pro Glu Ser Val Val Leu Ser 
725 730 735 

ggt gac cgc gag ctg etc ace gag ate get gat egg ttg cac gat cag 2256 

Gly Asp Arg Glu Leu Leu Thr Glu He Ala Asp Arg Leu His Asp Gin 

740 745 750 

ggg tgc egg ace egg tgg ttg egg gtg teg cat" get ttc cat teg ccc 2304 

Gly Cys Arg Thr Arg Trp Leu Arg Val Ser His Ala Phe His Ser Pro 

755 760 765 



cat atg gag ccg atg ctg gag gag ttc gee cag ate tec cga ggc cgc 2352 

His Met Glu Pro Met Leu Glu Glu Phe Ala Gin He Ser Arg Gly Arg 

770 775 780 

gaa tat cac gca ccg gaa ctg ccg ate ate teg ace ctg ate ggt gag 2400 

Glu Tyr His Ala Pro Glu Leu Pro He He Ser Thr Leu He Gly Glu 

785 790 795 800 



ctg gac ggt ggt cga gtg atg ggc act ccc gag tac tgg gtg cgt cag 2448 
Leu Asp Gly Gly Arg Val Met Gly Thr Pro Glu Tyr Trp Val Arg Gin 
805 810 815 



gtg cgt gag ccc gtc cgt ttc gee gag ggt gtc cag gcg ctt gtc ggt 2496 
Val Arg Glu Pro Val Arg Phe Ala Glu Gly Val Gin Ala Leu Val Gly 
820 825 830 



cag ggt gtc ggc acg att gtc gaa ttg ggt ccg gac ggg gcg ttg teg 2544 
Gin Gly Val Gly Thr He Val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 



acg ttg gtc gag gag tgt gtg gcg gaa tec ggg egg gtg gee ggg ate 2592 
Thr Leu Val Glu Glu Cys Val Ala Glu Ser Gly Arg Val Ala Gly He 
850 855 860 



ccg ctg atg cgc aag gac cgc gac 
Pro Leu Met Arg Lys Asp Arg Asp 
865 870 

ttg gcg cag ate cac ace cgt ggt 
Leu Ala Gin He His Thr Arg Gly 
885 



gag gcg cga acc gtg ctg gca get 2640 

Glu Ala Arg Thr Val Leu Ala Ala 
875 880 

ggt gag gtg gac tgg egg teg ttt 2688 

Gly Glu Val Asp Trp Arg Ser Phe 
•890 895 



ttc gec ggt acc ggg gcg aag caa gtc gac ctg ccc acc tac gee ttc 2736 
Phe Ala Gly Thr Gly Ala Lys Gin Val Asp Leu Pro Thr Tyr Ala Phe 
900 905 910 



230 



WO 01/16303 PCT/EP00/08013 

cag egg cag egg tac tgg ctg gca tec acc ggg cgt gcg ggt gac gtg 2 784 
Gin Arg Gin Arg Tyr Trp Leu Ala Ser Thr Gly Arg Ala Gly Asp Val 
915 920 925 

acc gee : gcc gga ttg gee gag gcg gac cat ccg ctg etc ggt gcg gtg " 2832 
Thr Ala Ala Gly Leu Ala Glu Ala Asp His Pro Leu Leu Gly Ala Val 
930 935 940 

gtt gcg ttg gca gac ggc gaa ggt gtg gtg ctg acc ggt egg ttg aca 2880 
Val Ala Leu Ala Asp Gly Glu Gly Val Val Leu Thr Gly Arg Leu Thr 
945 950 955 960 

gcg ggt teg cat ccg tgg ttg tec gat cac egg gtg ctg ggc gaa ate 2928 
Ala Gly Ser His Pro Trp Leu Ser Asp His Arg Val Leu Gly Glu lie 
965 970 975 

gtc gtc ccc ggc acc gcg ate gtc gag ctg gtg tgg cac gtc ggc gag 2976 
Val Val Pro Gly Thr Ala He Val Glu Leu Val Trp His Val Gly Glu 
980 985 990 

cgc etc ggt tgt ggc egg gtg gaa gaa ctg get ttg gaa gcg ccc ctg 3024 
Arg Leu Gly Cys Gly Arg Val Glu Glu Leu Ala Leu Glu Ala Pro Leu 
995 1000 1005 

ate ctg ccg gat cat gga gcg gtc cag gtt cag gtg ctg gtg gga ccg 3 072 
He Leu Pro Asp His Gly Ala Val Gin Val Gin Val Leu Val Gly Pro 
1010 1015 1020 

ccc ggg gaa tec gga gee egg teg gtg gcg etc tac tec tgt cct ggc 3120 
Pro Gly Glu Ser Gly Ala Arg Ser Val Ala Leu Tyr Ser Cys Pro Gly 
1025 1030 1035 1040 

gag gcg ate gaa ccc gag tgg aag aag cac gcg acg ggc gtg ctt etc 3168 
Glu Ala He Glu Pro Glu Trp Lys Lys His Ala Thr Gly Val Leu Leu 
1045 1050 1055 

cca ccc gtg gec gee gag aac cat gag ctg acc gca tgg ccc ccg gag 3216 
Pro Pro Val Ala Ala Glu Asn His Glu Leu Thr Ala Trp Pro Pro Glu 
1060 1065 1070 

aat gcg acc gaa ate gat gca gac ggg gtc tac gca ttc ctt gaa ggg 3264 
Asn Ala Thr Glu He Asp Ala Asp Gly Val Tyr Ala Phe Leu Glu Gly 
1075 1080 1085 

cac ggt ttc gcg tac gga ccg gee ttt aga tgt ctg cgc ggt gee tgg 3312 
His Gly Phe Ala Tyr Gly Pro Ala Phe Arg Cys Leu Arg Gly Ala Trp 
1090 1095 1100 



231 



WO 01/16303 



PCT/EP00/08013 



cga cga ggc ggg gag gtg ttc gcc gaa gtc gca ttg ccg gat gac atg 
Arg Arg Gly Gly Glu Val Phe Ala Glu Val Ala Leu Pro Asp Asp Met 
1105 1110 1115 1120 



3360 



cag gcg ggg gtc gat cga ttc ggc gtc cac ccc gcg ttg ctg gac gcg 
Gin Ala Gly Val Asp Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala 
1125 1130 1135 



3408 



gtt ctg cat gcc gcc gca gcc gag acg teg gtg gtc cag age gaa gcg 
Val Leu His Ala Ala Ala Ala Glu Thr Ser Val Val Gin Ser Glu Ala 
1140 1145 1150 



3456 



egg gtg ccg ttc teg tgg cgt ggg gtg gaa ctt cgc gcc act gaa age 
Arg Val Pro Phe Ser Trp Arg Gly Val Glu Leu Arg Ala Thr Glu Ser 
1155 1160 . 1165 



3504 



gcg gtg gtg egg gcg cgc etc teg ttg act teg gat gac gaa ctg teg 
Ala Val Val Arg Ala Arg Leu Ser Leu Thr Ser Asp Asp Glu Leu Ser 
1170 1175 1180 



3552 



ttg gtc gca gtg gac ccg get ggc cga ttc gtg gcc acg gtt gat teg 
Leu Val Ala Val Asp Pro Ala Gly Arg Phe Val Ala Thr Val Asp Ser 
1185 1190 1195 1200 



3600 



ctg gtg ace cga ccg ate tec egg cag cag gtg agg tct ggc gcg ate 
Leu Val Thr Arg Pro lie Ser Arg Gin Gin Val Arg Ser Gly Ala lie 
1205 1210 1215 



3648 



ggt gat tgc ctg ttc gag gtg gag tgg cac egg aag gcg ttg ttg gga 
Gly Asp Cys Leu Phe Glu Val Glu Trp His Arg Lys Ala Leu Leu Gly 
1220 1225 1230 



3696 



aca ace gcc ggc gac gac ctt gcc ate gtc ggt gac ggt ccc agt tgg 
Thr Thr Ala Gly Asp Asp Leu Ala He Val Gly Asp Gly Pro Ser Trp 
1235 1240 1245 



3744 



ccg gaa teg gtg cgc gca ace gca egg ttc gcg ace ctg gat gag ttc 
Pro Glu Ser Val Arg Ala Thr Ala Arg Phe Ala Thr Leu Asp Glu Phe 
1250 1255 1260 



3792 



cgt gcg gcc gtg gac teg gac gtt cct gcc ccg ggt teg gtg ttg gtc 
Arg Ala Ala Val Asp Ser Asp Val Pro Ala Pro Gly Ser Val Leu Val 
1265 1270 1275 1280 



3840 



gca get atg teg gcc gaa gag gtc gag ggt gga tec ctg ccg teg cgc 
Ala Ala Met Ser Ala Glu Glu Val Glu Gly Gly Ser Leu Pro Ser Arg 
1285 1290 1295 



3888 
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gcc caa gag teg acc tec gat ctg ctg get etc gtg cag teg tgg ctt 
Ala Gin Glu Ser Thr Ser Asp Leu Leu Ala Leu Val Gin Ser Trp Leu 
1300 1305 1310 



3936 



gcg gac gag egg ttc gcc gaa tec cag etc gtg* gtc gtc acg cgt gca 
Ala Asp Glu Arg Phe Ala Glu Ser Gin Leu Val Val Val Thr Arg Ala 
1315 1320 1325 



3984 



gcg gtg teg gcc gac teg gat teg gac gtc gcg gac ctg gtg ggt gcg 
Ala Val Ser Ala Asp Ser Asp Ser Asp Val Ala Asp Leu Val Gly Ala 
1330 1335 1340 



4032 



teg teg tgg ggg ttg ttg agt tea gcc cag teg gag aac ccg ggt cgc 
Ser Ser Trp Gly Leu Leu Ser Ser Ala Gin Ser Glu Asn Pro Gly Arg 
1345 1350 1355 1360 



4080 



ttc gtg ctg gtg gac gtg gac ggc aca cct gag teg tgg cag gcg ttg 
Phe Val Leu Val Asp Val Asp Gly Thr Pro Glu Ser Trp Gin Ala Leu 
1365 1370 1375 



4128 



ccg gcc gcc gtg cga gca gga gaa ccg cag ctg gca ctt egg cgc ggc 
Pro Ala Ala Val Arg Ala Gly Glu Pro Gin Leu Ala Leu Arg Arg Gly 
1380 1385 1390 



4176 



gtg gcg ctg gtg cct egg ttg gcg cga etc acg gtg cgc gag gag ggc 
Val Ala Leu Val Pro Arg Leu Ala Arg Leu Thr Val Arg Glu Glu Gly 
1395 1400 1405 



4224 



tec tec ccg caa etc gac acg gac ggg acc gtc etc ate acg ggt ggc 
Ser Ser Pro Gin Leu Asp Thr Asp Gly Thr Val Leu He Thr Gly Gly 
1410 1415 1420 



4272 



acc ggt gcg ttg ggg gga gtg gtt gcc cgt cac ctg gtg gag gag cac 
Thr Gly Ala Leu Gly Gly Val Val Ala Arg His Leu Val Glu Glu His 
1425 1430 1435 1440 



4320 



ggg att egg cgt ttg gtg ttg gca ggc egg cgt ggc tgg aat gcg cct 
Gly He Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp Asn Ala Pro 
1445 1450 1455 



4368 



gga gtc cac gag ttg gtg gat gag ctg gcg cgc gcg ggc gcc gtg gtt 
Gly Val His Glu Leu Val Asp Glu Leu Ala Arg Ala Gly Ala Val Val 
1460 1465 1470 



4416 



gag gtg gtg get tgc gat gtg get gac cgc acc gat ctg gag cac gtg 
Glu Val Val Ala Cys Asp Val Ala Asp Arg Thr Asp Leu Glu His Val 
1475 1480 1485 



4464 
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ctg gcc gcc att ccg gtc gac tgg ccg ctg egg ggg ate gtg cat ace 4512 
Leu Ala Ala lie Pro Val Asp Trp Pro Leu Arg Gly He Val His Thr 
1490 1495 1500 

get ggg gtg ctg gcc gac gga gtg ate ggg tec ttg teg gcg gcg gat -4560 
Ala Gly Val Leu Ala Asp Gly Val He Gly Ser Leu Ser Ala Ala Asp 
1505 1510 1515 1520 

gtg ggc acg gtg ttt gcc ccg aag gtg acg ggg gca tgg cat ctg cac 4608 
Val Gly Thr Val Phe Ala Pro Lys Val Thr Gly Ala Trp His Leu His 
1525 1530 1535 

gag ttg ace cgc gat ctg gat ctg teg ttc ttc gtt ctt ttc tct tec 4656 
Glu Leu Thr Arg Asp Leu Asp Leu Ser Phe Phe Val Leu Phe Ser Ser 
1540 1545 1550 

ttc tec ggg att gcg ggt gcc gca ggg cag gcc aac tac gcg gcg gcg 4704 
Phe Ser Gly He Ala Gly Ala Ala Gly Gin Ala Asn Tyr Ala Ala Ala 
1555 1560 1565 

aac acg ttc ctg gat gca ttg gcg cgt tat cgc egg gcg cgt ggg ctg 4752 
Asn Thr Phe Leu Asp Ala Leu Ala Arg Tyr Arg Arg Ala Arg Gly Leu 
1570 1575 1580 

cct ggg ttg teg ttg gcg tgg gga ctg tgg gcg caa ccc age ggt atg 4 800 
Pro Gly Leu Ser Leu Ala Trp Gly Leu Trp Ala Gin Pro Ser Gly Met 
1585 1590 1595 1600 

acg agt ggc ttg gac gcg gcg teg gtg gag egg ttg gcg egg acg ggc 4 84 8 
Thr Ser Gly Leu Asp Ala Ala Ser Val Glu Arg Leu Ala Arg Thr Gly 
1605 1610 1615 

ate gca gaa ctt tec acg gag gat gga etc cgc ctg ttc gat gcc gcg 4 896 
He Ala Glu Leu Ser Thr Glu Asp Gly Leu Arg Leu Phe Asp Ala Ala 
1620 1625 1630 

ttc gcg aag gac egg get tgc gtc gtt gcc get cga ttg gac agg gcg 4 944 
Phe Ala Lys Asp Arg Ala Cys Val Val Ala Ala Arg Leu Asp Arg Ala 
1635 1640 1645 

ctg ctg gtc ggg aac gga cga teg cac gcg att ccg gcg ctg ttg age 4992 
Leu Leu Val Gly Asn Gly Arg Ser His Ala He Pro Ala Leu Leu Ser 
1650 1655 1660 

gcg ttg gtt cct gtt cgc ggc ggt gtg gcg agg aaa aca gcc aat tct 5040 
Ala Leu Val Pro Val Arg Gly Gly Val Ala Arg Lys Thr Ala Asn Ser 
1665 1670 1675 1680 
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cag gcc gcg gat gag gac gca ctg ttg ggt ttg gtg egg gag cac gtt 
Gin Ala Ala Asp Glu Asp Ala Leu Leu Gly Leu Val Arg Glu His Val 
1685 1690 1695 



5088 



teg gcc gtg ctg ggt tat teg ggt gcg gtc gag gtt ggg ggc gac cgt 
Ser Ala Val Leu Gly Tyr Ser Gly Ala Val Glu Val Gly Gly Asp Arg 
1700 1705 1710 



5136 



get ttc cgt gat ctg ggt ttt gat teg ttg tct ggc gtg gag ttg egg 
Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Leu Arg 
1715 1720 1725 



5184 



aac cgc ctt gcc ggg gtg ctg ggg gtg egg ttg ccg gcg act gcg gtg 
Asn Arg Leu Ala Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala Val 
1730 1735 1740 



5232 



ttc gac tat ccg acg ccg egg gcg ctg gcg cgt ttc ctg cat cag gaa 
Phe Asp Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu 
1745 1750 1755 1760 



5280 



ctg gca ggc gag gtc gcg tec acg teg acg ccg gtg ace agg gca gcg 
Leu Ala Gly Glu Val Ala Ser Thr Ser Thr Pro Val Thr Arg Ala Ala 
1765 1770 1775 



5328 



agt gcc gaa gag gat ctt gtt gcg att gtc ggg atg gga tgt cgt ttt 
Ser Ala Glu Glu Asp Leu Val Ala lie Val Gly Met Gly Cys Arg Phe 
1780 1785 1790 



5376 



ccg ggt ggg gtg teg teg ccg gag gag ctt tgg egg ctg gtg gcc ggc 
Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly 
1795 1800 1805 



5424 



ggc gtg gat gcg gtg get ggg ttc cca gac gat cgc ggc tgg gat etc 
Gly Val Asp Ala Val Ala Gly Phe Pro Asp Asp Arg Gly Trp Asp Leu 
1810 1815 1820 



5472 



gcg gcg ttg tac gat cct gat ccc gat cgt etc ggg ace teg tat gtg 
Ala Ala Leu Tyr Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr Val 
1825 1830 1835 1840 



5520 



tgt gag ggc ggg ttt ctg egg gac gcg gcg gag ttc gat get gac atg 
Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Asp Met 
1845 1850 1855 



5568 



ttc ggc ate age ccg cgt gag gcg ttg gcg atg gat ccg cag cag egg 
Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
I860 1865 1870 



5616 
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ttg ctg ctg gag gtcgcc tgg gaa acc ttg gag egg get ggg ate gat 
Leu Leu Leu Glu Val Ala Trp Glu Thr Leu Glu Arg Ala Gly lie Asp 
1875 1880 1885 



5664 



ccg ttc teg ttg cac ggc age egg acc ggt gtg ttc gcg ggc ttg atg 
Pro Phe Ser Leu His Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met 
1890 1895 1900 



5712 



tac cac gac tat ggg gee cga ttc att acc aga gca ccg gag ggc ttc 
Tyr His Asp Tyr Gly Ala Arg Phe He Thr Arg Ala Pro Glu Gly Phe 
1905 1910 1915 1920 



5760 



gaa ggg cac etc ggg acg ggc aat gcg ggg age gtg ctg teg ggt egg 
Glu Gly His Leu Gly Thr Gly Asn Ala Gly Ser Val Leu Ser Gly Arg 
1925 1930 1935 



5808 



gtt gcg tat teg ttt ggt ttc gag ggt cct gcg gtg acg gtg gat acg 
Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
1940 1945 1950 



5856 



gcg tgt teg teg teg ttg gtg gcg tta cac ctg gcg ggt caa gca ctg 
Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ala Leu 
1955 I960 1965 



5904 



egg gec ggt gag tgc gaa ttc gec ctt gee ggt ggc gtc acg gtg atg 
Arg Ala Gly Glu Cys Glu Phe Ala Leu Ala Gly Gly Val Thr Val Met 
1970 1975 1980 



5952 



teg acg ccg acg acg ttc gtg gag ttc tec cgt caa egg ggt ctg get 
Ser Thr Pro Thr Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
1985 1990 1995 2000 



6000 



ccg gat ggg egg tgc aag teg ttc gcg gcg gee gcg gat ggc acc ggg 
Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly 
2005 2010 2015 



6048 



tgg ggc gag ggt gee ggt ctg gtg ttg ctg gag egg ttg teg gat gee 
Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala 
2020 2025 2030 



6096 



egg cgc aat ggg cac gag gtt ctg gcg gtg gtg egg ggt age gcg gtg 
Arg Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser Ala Val 
2035 2040 2045 



6144 



aac cag gac ggc gcg teg aat ggc ttg act gcg cca aat ggt ccg tea 
Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
2050 2055 2060 



6192 
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cag caa agg gtg ate acc cag gca etc acg agt gec ggg ctg tec gtg 
Gin Gin Arg Val lie Thr Gin Ala Leu Thr Ser Ala Gly Leu Ser Val 
2065 2070 2075 2080 



6240 



tec gac gtg gat get gtg gag gcg cat ggg acg ggc acg egg ctt ggt 
Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
2085 2090 2095 



6288 



gat ccg ate gag gcg cag gcg ttg ate get acg tac ggc egg gat cgt 
Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr Gly Arg Asp Arg 
2100 2105 2110 



6336 



gat ccc ggt egg ccg ttg tgg ctg ggg teg gtg aag teg aat att ggt 
Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly 
2115 2120 2125 



6384 



cac acc cag gcg gcg gcg ggt gtc get ggt gtg ate aag atg gtg atg 
His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Met 
2130 2135 2140 



6432 



gcg atg egg cag ggg gag ctg ccg cgc acg ttg cac gtg gac gag ccc 
Ala Met Arg Gin Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro 
2145 2150 2155 2160 



6480 



tec gcg cag gtg gac tgg tct gcg ggc acg gtc caa etc etc acg gag 
Ser Ala Gin Val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu 
2165 2170 2175 



6528 



aac acg ccc tgg ccc gac age ggt cgt ctt cgc egg gcg ggc gtg tea 
Asn Thr Pro Trp Pro Asp Ser Gly Arg Leu Arg Arg Ala Gly Val Ser 
2180 2185 2190 



6576 



teg ttc ggg ate agt ggc acc aac gcg cac ctg ate ctt gaa caa cct 
Ser Phe Gly lie Ser Gly Thr Asn Ala His Leu He Leu Glu Gin Pro 
2195 2200 2205 



6624 



ccg cga gag teg cag cgc tea aca gag ccg gat teg ggt tct gtc cgc 
Pro Arg Glu Ser Gin Arg Ser Thr Glu Pro Asp Ser Gly Ser Val Arg 
2210 2215 2220 



6672 



gat ttt ccg gtg gtg ccg tgg atg gtg teg ggc aaa aca ccc gaa gcg 
Asp Phe Pro Val Val Pro Trp Met Val Ser Gly Lys Thr Pro Glu Ala 
2225 2230 2235 2240 



6720 



eta tec gee cag gca gat gca ttg atg tec tac ttg age aat cgc gtt 
Leu Ser Ala Gin Ala Asp Ala Leu Met Ser Tyr Leu Ser Asn Arg Val 
2245 2250 2255 



6768 
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gat get tec ccg cga gat ate ggt tat teg ctt gcg gtg ace cgt ccg 6816 
Asp Ala Ser Pro Arg Asp He Gly Tyr Ser Leu Ala Val Thr Arg Pro 
2260 2265 2270 

gcg ttg gac cac cgc get gtc gtg ctg ggt gcg gat cgt gee gcg ttg 6864 
Ala Leu Asp His Arg Ala Val Val Leu Gly Ala Asp Arg Ala Ala Leu 
2275 2280 2285 

ctg ccg ggc ttg aaa gcg ctg gee gtt agt aat gac get gee gag gtg 6912 
Leu Pro Gly Leu Lys Ala Leu Ala Val Ser Asn Asp Ala Ala Glu Val 
2290 2295 2300 

ate ace ggc act cgt gee get ggg ccg gtc gga ttc gtg ttc tec ggt 6960 
He Thr Gly Thr Arg Ala Ala Gly Pro Val Gly Phe Val Phe Ser Gly 
2305 2310 2315 2320 

caa ggt ggt cag tgg ccc ggg atg gga age ggg etc cac teg gcg ttt 7008 
Gin Gly Gly Gin Trp Pro Gly Met Gly Ser Gly Leu His Ser Ala Phe 
2325 2330 2335 

ccg gtg ttc gee gac gcg ttt gac gaa gee tgc tgc gag ctg gat gcg 7056 
Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Cys Glu Leu Asp Ala 
2340 2345 2350 



cat etc ggg cag atg gee egg eta cga gat gtg ttg tec ggt teg gat 7104 
His Leu Gly Gin Met Ala Arg Leu Arg Asp Val Leu Ser Gly Ser Asp 
2355 2360 2365 

acg caa ctt ctg gac cag ace ttg tgg gcg cag ccg ggc ctg ttc gcg 7152 
Thr Gin Leu Leu Asp Gin Thr Leu Trp Ala Gin Pro Gly Leu Phe Ala 
2370 2375 2380 

ttg caa gtc gga etc tgg gag ttg ttg ggt teg tgg ggt gtc egg ccc 7200 
Leu Gin Val Gly Leu Trp Glu Leu Leu Gly Ser Trp Gly Val Arg Pro 
2385 2390 2395 2400 



get gtg gtg ctg ggc cac teg gtc ggt gag ctg gcg gcg gcg ttc gcg 724 8 
Ala Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala 
2405 2410 2415 



get gga gtg ttg teg ttg egg gat gcg get egg ctg gtg gcg ggc cgt 72 96 
Ala Gly Val Leu Ser Leu Arg Asp Ala Ala Arg Leu Val Ala Gly Arg 
2420 2425 2430 

gee egg ttg atg caa gee ctg cca act ggc ggt gee atg etc get gcg 7344 
Ala Arg Leu Met Gin Ala Leu Pro Thr Gly Gly Ala Met Leu Ala Ala 
2435 2440 2445 
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get get gga gag gag cag ctg cgc ccg ttg ctg gec gac tgc ggt gat 7392 
Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala Asp Cys Gly Asp 
2450 2455 2460 



cgt gtg ggg ate gec gcg gtc aac 
Arg Val Gly lie Ala Ala Val Asn 
2465 2470 

ggt gat egg gat gtg etc gat gac 
Gly Asp Arg Asp Val Leu Asp Asp 
2485 



get dec ggg teg gtg gtg etc tec 744 0 
Ala Pro Gly Ser Val Val Leu Ser 
2475 2480 

att gee ggt egg ctg gac ggg caa 7488 
lie Ala Gly Arg Leu Asp Gly Gin 
2490 2495 



ggg ate egg tec agg tgg ttg egg gtt teg cat gcg ttt cat teg cat 7536 
Gly lie Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser His 
2500 2505 2510 

egg atg gat ccg atg ctg gcg gag ttc acc gaa ate gec egg age gtg 7584 
Arg Met Asp Pro Met Leu Ala Glu Phe Thr Glu lie Ala Arg Ser Val 
2515 2520 2525 



gac tac egg teg tea ggg ctg ccg ate gtg teg acg ttg acg ggt gag 7632 

Asp Tyr Arg Ser Ser Gly Leu Pro lie" Val Ser Thr Leu Thr Gly Glu 
2530 2535 2540 

etc gat gag gtc ggc atg ccg get acg ccg gag tat tgg gtg cgc cag 7680 

Leu Asp Glu Val Gly Met Pro Ala Thr Pro Glu Tyr Trp Val Arg Gin 
2545 2550 2555 2560 

gtg cga gaa ccc gtc cgc ttc gee gac ggt gtt get gcg etc gcg get 7728 

Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Ala Ala Leu Ala Ala 
2565 2570 2575 



cac ggt gtg age acc gtc gtc gag gtc ggt ccg gat ggg gtg ttg teg 7776 
His Gly Val Ser Thr Val Val Glu Val Gly Pro Asp Gly Val Leu Ser 
2580 2585 2590 

gcg ctg gtg cag gag tgc gcg gec gga tec gat cag ggc gga egg gtg 7824 
Ala Leu Val Gin Glu Cys Ala Ala Gly Ser Asp Gin Gly Gly Arg Val 
2595 2600 2605 



gec gcg gtt ccg etc atg cgc age aat cgc gac gag gcg cac acg gtg 7 872 

Ala Ala Val Pro Leu Met Arg Ser Asn Arg Asp Glu Ala His Thr Val 
2610 2615 2620 

aca acg gca ttg gcg cag ate cat gtg cgt ggt get gag gtg gac tgg 7 920 

Thr Thr Ala Leu Ala Gin He His Val Arg Gly Ala Glu Val Asp Trp 
2625 2630 2635 2640 
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egg teg ttt ttc gec ggt acc ggg gca aag cag gtc gag ctg ccc acg 
Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin Val Glu Leu Pro Thr 
2645 2650 2655 



7968 



tat gec ttc caa cga cag egg tac tgg ctt gac tea cca tec gaa ccg 
Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ser Pro Ser Glu Pro 
2660 2665 2670 . 



8016 



gtc ggg caa tec gee gat ccc gcg cgc cag teg ggc ttc tgg gaa etc 
Val Gly Gin Ser Ala Asp Pro Ala Arg Gin Ser Gly Phe Trp Glu Leu 
2675 2680 2685 



8064 



gtc gag cag gaa gat gtc age gcg etc age gec get ctg cac att acc 
Val Glu Gin Glu Asp Val Ser Ala Leu Ser Ala Ala Leu His lie Thr 
2690 2695 2700 



8112 



ggc gat cac gac gtg cag gcg tec ctg gaa teg gtg gtt ccg gtc etc 
Gly Asp His Asp Val Gin Ala Ser Leu Glu Ser Val Val Pro Val Leu 
2705 2710 2715 2720 



8160 



tec tec tgg cat cgc egg ate cgc aac gaa tec ctg gtg cac cag tgg 
Ser Ser Trp His Arg Arg lie Arg Asn Glu Ser Leu Val His Gin Trp 
2725 2730 2735 



8208 



egg tac egg att tec tgg cat gag egg gca gat ttg cca gac ccc teg 
Arg Tyr Arg He Ser Trp His Glu Arg Ala Asp Leu Pro Asp Pro Ser 
2740 2745 2750 



8256 



ttg teg ggg aca tgg etc gtc gtc gtg ccg gag ggg tgg teg gcg agt 
Leu Ser Gly Thr Trp Leu Val Val Val Pro Glu Gly Trp Ser Ala Ser 
2755 2760 2765 



8304 



egg caa gtt ctg cgt ttc aac gag atg ttc gag gaa egg ggt tgc ccg 
Arg Gin Val Leu Arg Phe Asn Glu Met Phe Glu Glu Arg Gly Cys Pro 
2770 2775 2780 



8352 



gca gtt ctg ttc gag etc gee ggg cac gac gag gaa gec ctg gcg caa 
Ala Val Leu Phe Glu Leu Ala Gly His Asp Glu Glu Ala Leu Ala Gin 
2785 2790 2795 ' 2800 



8400 



cga ttc cgc teg ttg cct gtt gcg tea ggg gga ata age ggc gtg ttg 
Arg Phe Arg Ser Leu Pro Val Ala Ser Gly Gly He Ser Gly Val Leu 
2805 2810 2815 



8448 



tec ttg ctg gcg ctg gat gaa teg ccg tec teg ccg aac get get ttg 
Ser Leu Leu Ala Leu Asp Glu Ser Pro Ser Ser Pro Asn Ala Ala Leu 
2820 2825 2830 



8496 
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ccg aat ggc gcg ctg aac teg ttg gta ctg ctg cga get ctg egg gee 
Pro Asn Gly Ala Leu Asn Ser Leu Val Leu Leu Arg Ala Leu Arg Ala 
2835 2840 2845 



8544 



gcg gat gtg teg gcg cca ttg tgg ttg gcg acg tgt ggt ggt gtc gcg 
Ala Asp Val Ser Ala Pro Leu Trp Leu Ala Thr Cys Gly Gly Val Ala 
2850 2855 2860 



8592 



gtc ggg gat gtg ccg gtg aac ccg ggg cag gcg ctg gtg tgg gga ctg 
Val Gly Asp Val Pro Val Asn Pro Gly Gin Ala Leu Val Trp Gly Leu 
2865 2870 2875 2880 



8640 



ggt cgc gtc gtc ggt ctg gag cat ccg gec tgg tgg ggt ggc ctg gtc 
Gly Arg Val Val Gly Leu Glu His Pro Ala Trp Trp Gly Gly Leu Val 
2885 2890 2895 



8688 



gac gtg ccg tgc ttg etc gat gag gac get cga gaa cgc ttg teg gtc 
Asp Val Pro Cys Leu Leu Asp Glu Asp Ala Arg Glu Arg Leu Ser Val 
2900 2905 2910 



8736 



gtg ttg gca ggt ctt ggc gag gac gag ate gcg gta cgt ccc ggt ggt 
Val Leu Ala Gly Leu Gly Glu Asp Glu He Ala Val Arg Pro Gly Gly 
2915 2920 2925 



8784 



gtg ttc gtg egg egg ttg gaa cgc get. ggt gcg gcg teg ggt gee ggg 8832 
Val Phe Val Arg Arg Leu Glu Arg Ala Gly Ala Ala Ser Gly Ala Gly ■ 
2930 2935 2940 

teg gtg tgg cgt cct egg ggg acg gtg ttg gtg acg ggt ggt acg ggc 8880 
Ser Val Trp Arg Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly 
2945 2950 2955 2960 

ggt ttg ggg gcg cat gtt gee egg tgg ttg gcg ggt gee ggg get gag 8928 
Gly Leu Gly Ala His Val Ala Arg Trp Leu Ala Gly Ala Gly Ala Glu 
2965 2970 2975 

cat gtg gtg ttg ace age cgt cga ggc gcg gcg get ccg ggc get gga 8976 
His Val Val Leu Thr Ser Arg Arg Gly Ala Ala Ala Pro Gly Ala Gly 
2980 2985 2990 

gat ttg egg gcg gag ctg gag gcg ctg ggc get egg gtt teg ate acg 9024 
Asp Leu Arg Ala Glu Leu Glu Ala Leu Gly Ala Arg Val Ser He Thr 
2995 3000 3005 

gee tgc gac gtg gee gat cgt gac get ttg gee gaa gtg ttg gcg ace 9072 
Ala Cys Asp Val Ala Asp Arg Asp Ala Leu Ala Glu Val Leu Ala. Thr 
3010 3015 3020 
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att ccg gat gat tgc ccg ctg acc gcg gtg atg cat gcg gcg ggg gtc 
lie Pro Asp Asp Cys Pro Leu Thr Ala Val Met His Ala Ala Gly Val 
3025 * 3030 3035 3040 



9120 



gtt gaa gtc ggc gac gtg gcg teg atg tgt ttg acc gac ttc gtt ggg 
Val Glu Val Gly Asp Val Ala Ser Met Cys Leu Thr Asp Phe Val Gly 
3045 3050 3055 



9168 



gtg ctg teg gcg aag gca ggt ggt gcg gcg aat etc gat gag ttg etc 
Val Leu Ser Ala Lys Ala Gly Gly Ala Ala Asn Leu Asp Glu Leu Leu 
3060 3065 3070 



9216 



gee gat gtc gag ctg gat gec ttc gtg ctg ttc tea tec gtc teg ggt 
Ala Asp Val Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Val Ser Gly 
3075 3080 3085 



9264 



gtg tgg ggt get ggc ggg cag ggc get tat gcg gcg gcg aat gee tac 
Val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Tyr 
3090 3095 3100 



9312 



ttg gat gcg ttg gcg cag cag cgt egg gca agg ggg ttg gtg ggg act 
Leu Asp Ala Leu Ala Gin Gin Arg Arg Ala Arg Gly Leu Val Gly Thr 
3105 3110 3115 3120 



9360 



gcg gtt gcg tgg ggc ccg tgg gee ggt gac gga atg gec gca ggt gaa 
Ala Val Ala Trp Gly Pro Trp Ala Gly Asp Gly Met Ala Ala Gly Glu 
3125 3130 3135 



9408 



ggc ggt gca cag ctg cgc egg gec ggc ctg gtg cca atg get gcg gat 
Gly Gly Ala Gin Leu Arg Arg Ala Gly Leu Val Pro Met Ala Ala Asp 
3140 ' 3145 3150 



9456 



egg gcg ttg ctg gca ctt cag ggc gca ttg gat cgt gac gag aca tec 
Arg Ala Leu Leu Ala Leu Gin Gly Ala Leu Asp Arg Asp Glu Thr Ser 
3155 3160 3165 



9504 



ctg gtc gtg gee gat atg gcg tgg gag agg ttc gee ccg gtg ttc gee 
Leu Val Val Ala Asp Met Ala Trp Glu Arg Phe Ala Pro Val Phe Ala 
3170 3175 3180 



9552 



atg tec cgt egg cgt ccg ctg etc gac gag ctg ccc gaa gca cag cag 
Met Ser Arg Arg Arg Pro Leu Leu Asp Glu Leu Pro Glu Ala Gin Gin 
3185 3190 3195 3200 



9600 



gcg ttg gcg gat gcg gag aac acc act gat get gcg gac teg gec gtc 
Ala Leu Ala Asp Ala Glu Asn Thr Thr Asp Ala Ala Asp Ser Ala Val 
3205 3210 3215 



9648 
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ccg eta ccg egg etc gcg ggc atg gca gee gee gaa cgc cgc cgc gcg 
Pro Leu Pro Arg Leu Ala Gly Met Ala Ala Ala Glu Arg Arg Arg Ala 
3220 3225 3230 



9696 



atg ctg gac ctg gtg ctg gcg gag gee teg att gtg ttg g$a cac aac 
Met Leu Asp Leu Val Leu Ala Glu Ala Ser lie Val Leu Gly His Asn 
3235 3240 3245 



9744 



ggg tct gac cca gtt ggt ccc gac egg gcg ttc cag gag etc gga ttt 
Gly Ser Asp Pro Val Gly Pro Asp Arg Ala Phe Gin Glu Leu Gly Phe 
3250 3255 3260 



9792 



gat teg ctg atg gee gtc gaa ctg cgc aac agg ttg ggc gag gca aca 
Asp Ser Leu Met Ala Val Glu Leu Arg Asn Arg Leu Gly Glu Ala Thr 
3265 3270 3275 3280 



9840 



gga ttg agt ctg ccg gee acg ttg ate ttc gat tat ccg age cca tec 
Gly Leu Ser Leu Pro Ala Thr Leu lie Phe Asp Tyr Pro Ser Pro Ser 
3285 3290 3295 



9888 



gcg ctg get gag cag ctg gtc ggc gag ctg gtg gga gcg cag ccc gcg 
Ala Leu Ala Glu Gin Leu Val Gly Glu Leu Val Gly Ala Gin Pro Ala 
3300 3305 3310 



9936 



ace acc gtc gtg gee ggg gee gat cca gtg gat gat ccg gtt gtc gtg 
Thr Thr Val Val Ala Gly Ala Asp Pro Val Asp Asp Pro Val Val Val 
3315 3320 3325 



9984 



gtc gcg atg gga tgc egg tat ccg ggc gac gtc tgc teg ccc gag gag 
Val Ala Met Gly Cys Arg Tyr Pro Gly Asp Val Cys Ser Pro Glu Glu 
3330 3335 3340 



10032 



ctg tgg cag ctg gtt tct gcg gga cgt gat gcg gta teg acg ttc ccc 
Leu Trp Gin Leu Val Ser Ala Gly Arg Asp Ala Val Ser Thr Phe Pro 
3345 3350 3355 3360 



10080 



gtc gat egg ggt tgg gac tgc aac acg ttg ttc gac ccg gat ccg gat 
Val Asp Arg Gly Trp Asp Cys Asn Thr Leu Phe Asp Pro .Asp Pro Asp 
3365 3370 3375 



10128 



egg gca ggc agt acc tat gtg cga gaa ggt gee ttc ctg acc ggt get 
Arg Ala Gly Ser Thr Tyr Val Arg Glu Gly Ala Phe Leu Thr Gly Ala 
3380 3385 3390 



10176 



gat egg ttc gac gee ggg ttc ttc ggc ate age cct cgc gag gcg cgc 
Asp Arg Phe Asp Ala Gly Phe Phe Gly He Ser Pro Arg Glu Ala Arg 
3395 3400 3405 



10224 
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gca atg gat ccg cag cag agg ttg ttg etc gaa gtg gcg tgg gag gtt 
Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Val Ala Trp Glu Val 
3410 3415 3420 



10272 



ttc gaa cga gca gga ate gpt ccg ctg teg ttg egg ggt age agg acc 
Phe Glu Arg Ala Gly lie Ala Pro Leu Ser Leu Arg Gly Ser Arg Thr 
3425 3430 3435 3440 



10320 



ggt gtg ttc gcg ggg acc aat ggg cag gac cac ggt gcg aaa gtg get 
Gly Val Phe Ala Gly Thr Asn Gly Gin Asp His Gly Ala Lys Val Ala 
3445 3450 3455 



10368 



gec gcg ccg gag gcg gcg ggt cac etc ctg acc gga aac gee gcg agt 
Ala Ala Pro Glu Ala Ala Gly His Leu Leu Thr Gly Asn Ala Ala Ser 
3460 3465 3470 



10416 



gtc ctg gee ggc egg ctt tec tac acg ttc ggc ctt gag ggg cct gcg 
Val Leu Ala Gly Arg Leu Ser Tyr Thr Phe Gly Leu Glu Gly Pro Ala 
3475 3480 3485 



10464 



gtg gcg gtg gat acc gcg tgt teg teg teg ttg gtg gcg ttg cat ttg 
Val Ala Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu 
3490 3495 3500 



10512 



gcg tgc cag teg ctg cgt teg ggt gag tgt gat atg gcg ttg gca ggt 
Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Met Ala Leu Ala Gly 
3505 3510 3515 3520 



10560 



ggt gtg acg gtg atg teg aca ccc ctg get ttc etc gag ttc tct cgt 
Gly Val Thr Val Met Ser Thr Pro Leu Ala Phe Leu Glu Phe Ser Arg 
3525 3530 3535 



10608 



cag cgc ggt ttg gcg cca gat ggt egg tgc aag teg ttt gcg gec get 
Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala 
3540 3545 3550 



10656 



gcg gat ggc acc ggg tgg ggt gag ggt gee ggc ctg gtg ttg ctg gag 
Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu 
3555 3560 3565 



10704 



egg ttg teg gat get cgt egg aat ggt cac egg gtg ttg gec gtg gtt 
Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Val 
3570 3575 3580 



10752 



cgc ggg tct gcg gtg aat cag gat ggt gcg teg aat ggc ctg act gcg 
Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala 
3585 3590 3595 3600 



10800 
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ccg aat ggt ccg teg cag cag egg gtg att egg cag gee etc gcg aat 10848 
Pro Asn Gly Pro Ser Gin Gin Arg Val He Arg Gin Ala Leu Ala Asn 
3605 3610 3615 



gcg ggg ctg teg gcg tec gat gtg* gat gtc gtg gag gcg cac ggg ace 10896 
Ala Gly Leu Ser Ala Ser Asp Val Asp Val Val Glu Ala His Gly Thr 
3620 3625 3630 



ggt ace ggg etc ggg gat ccg ate gag gcg cag gcg ctg ate gcg aca 10944 
Gly Thr Gly Leu Gly Asp Pro He Glu Ala Gin Ala Leu lie Ala Thr 
3635 3640 3645 

tat ggg cag gag egg gat cct gag egg gee ctg tgg ctg ggg teg ate 10992 
Tyr Gly Gin Glu Arg Asp Pro Glu Arg Ala Leu Trp Leu Gly Ser He 
3650 3655 3660 



aag tec aac ate ggc cac acg cag gcg gcg gee ggt gtg gcg ggg gtc 11040 
Lys Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val 
3665 3670 3675 3680 



ate aag atg gtg cag gee atg egg cac ggg gag ttg cct gcg acg ttg 11088 
He Lys Met Val Gin Ala Met Arg His Gly Glu Leu Pro Ala Thr Leu 
3685 3690 3695 

cac gtg gac aag ccc act cca cag gtg gac tgg tct gee ggg gee gtt 11136 
His Val Asp Lys Pro Thr Pro Gin Val Asp Trp Ser Ala Gly Ala Val 
3700 3705 3710 

egg etc etc acc ggg aac acg ccc tgg ccc gag age ggc cgt cct cgt 11184 
Arg Leu Leu Thr Gly Asn Thr Pro Trp Pro Glu Ser Gly Arg Pro Arg 
3715 3720 3725 

cga gcg ggg gtg teg teg ttc ggg ate age. ggc acc aac gca cac etc 11232 
Arg Ala Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Leu 
3730 3735 3740 



ate etc gaa caa cca ccg teg gaa cca gcg gag ate gac caa teg gat 11280 
He Leu Glu Gin Pro Pro Ser Glu Pro Ala Glu He Asp Gin Ser Asp 
3745 3750 3755 3760 



egg egg gtc act gcg cat cca gcg gtg ate ccg tgg atg ttg teg get 11328 
Arg Arg Val Thr Ala His Pro Ala Val He Pro Trp Met Leu Ser Ala 
3765 3770 3775 



agg agt etc gca gcg ctg cag gee caa gcg get gcg ctg cag gee egg 11376 
Arg Ser Leu Ala Ala Leu Gin Ala Gin Ala Ala Ala Leu Gin Ala Arg 
3780 3785 3790 
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ctg gac egg ggt cct ggc get tct ccg 
Leu Asp Arg Gly Pro Gly Ala Ser Pro 
3795 3800 

gcg acc act cgt tct gtg ctg gac gaa 
Ala Thr Thr Arg Ser Val Leu Asp Glu 
3810 3815 



ctg gat ttg ggg tat tea etc 11424 
Leu Asp Leu Gly Tyr Ser Leu 
3805 

cgc gee gtc gtg tgg ggt gec 11472 
Arg Ala Val Val Trp Gly Ala 
3820 



gat egg gag gca ctg ctg tec agg ctg gca gcg etc gec gat ggc egg 11520 
Asp Arg Glu Ala Leu Leu Ser Arg Leu Ala Ala Leu Ala Asp Gly Arg 
3825 3830 3835 3840 

acg gcg ccg ggg gtg ata acg ggc tct gcg aat tec ggt ggc cgc ate 11568 
Thr Ala Pro Gly Val lie Thr Gly Ser Ala Asn Ser Gly Gly Arg lie 
3845 3850 3855 

gga ttc gtt ttt tec ggt cag ggc agt cag tgg ctg ggg atg gga aag 11616 
Gly Phe Val Phe Ser Gly Gin Gly Ser Gin Trp Leu Gly Met Gly Lys 
3860 3865 3870 

gcg ttg tgc gcg get ttc ccg gcg ttc gcg gac gee ttc gag gaa gee 11664 
Ala Leu Cys Ala Ala Phe Pro Ala Phe Ala Asp Ala Phe Glu Glu Ala 
3875 3880 3885 

tgc gac gcg eta age gca cac ctg ggc gcg gac gtt egg ggt gtg ctg 11712 
Cys Asp Ala Leu Ser Ala His Leu Gly Ala Asp Val Arg Gly Val Leu 
3890 3895 3900 

ttc ggt get gat "gag cag atg etc gac egg acg ctg tgg gcg cag teg 11760 
Phe Gly Ala Asp Glu Gin Met Leu Asp Arg Thr Leu Trp Ala Gin Ser 
3905 3910 3915 3920 

ggg ate ttc gcg gtt caa gtc ggc etc ctg gga ttg ctg agg teg tgg 11808 
Gly lie Phe Ala Val Gin Val Gly Leu Leu Gly Leu Leu Arg Ser Trp 
3925 3930 3935 



ggc gtg egg ccg gee gcg gtg ctg ggg cac teg gtc ggc gag ttg get 11856 

Gly Val Arg Pro Ala Ala Val Leu Gly His Ser Val Gly Glu Leu Ala 
3940 3945 3950 

gcg gcg cac gcg get ggt gtg ttg tec ttg ccg gac get gca egg ttg 11904 

Ala Ala His Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg Leu 
3955 3960 3965 

gtt gcg get egg gee cac ctg atg cag gca ttg ccc acc ggc ggc gca 11952 

Val Ala Ala Arg Ala His Leu Met Gin Ala Leu Pro Thr Gly Gly Ala 
3970 3975 3980 



246 



WO 01/16303 PCT/EPOO/08013 



atg etc gcg gtc gec acc age gag gcg gcg gtc gga ccg ctg ctt tec 12000 
Met Leu Ala Val Ala Thr Ser Glu Ala Ala Val Gly Pro Leu Leu Ser 
3985 3990 3995 4000 

ggg gtg tgc gat egg gtc age ate get gcg ate aac ggc ccc gag teg 1204 8 
Gly Val Cys Asp Arg Val Ser lie Ala Ala He Asn Gly Pro Glu Ser 
4005 4010 4015 



gta gtg etc tec ggc gac cgc gat gtg etc gtg gag etc gca ggc gaa 12096 
Val Val Leu Ser Gly Asp Arg Asp Val Leu Val Glu Leu Ala Gly Glu 
4020 4025 4030 . 

ttc gat gee cga ggg ctt agg acc aaa tgg ttg egg gtc tec cat get 12144 
Phe Asp Ala Arg Gly Leu Arg Thr Lys Trp Leu Arg Val Ser His . Ala 
4035 4040 4045 

ttc cac teg cac egg atg gaa ccg att ctg gac gag tac gcg gaa acc 12192 
Phe His Ser His Arg Met Glu Pro He Leu Asp Glu Tyr Ala Glu Thr 
4050 4055 4060 



gee agg tgc gtc gag ttc ggt gaa ccg gtg gtg ccg ate gtc tec gee 12240 
Ala Arg Cys Val Glu Phe Gly Glu Pro Val Val Pro He Val Ser Ala 
4065 4070 4075 4080 



gcg acc ggt gcg ctg gac acc acc gga ctg atg tgc gcg gec gac tac 12288 
Ala Thr Gly Ala Leu Asp Thr Thr Gly Leu Met Cys Ala Ala Asp Tyr 
4085 4090 - 4095 

tgg acg cgc caa gtg cgt gat cct gtc cgc ttc gga gac ggt gtc egg 12336 
Trp Thr Arg Gin Val Arg Asp Pro Val Arg Phe Gly Asp Gly Val Arg 
4100 4105 4110 



gcg etc gtc ggc caa ggc gtg gac acg ate gtc gag ttc ggc ccg gac 12384 
Ala Leu Val Gly Gin Gly Val Asp Thr He Val Glu Phe Gly Pro Asp 
4115 4120 4125 



ggg gcg ttg teg gee ctg gtc gag cag tgc ttg gee ggg tec gac cag 12432 
Gly Ala Leu Ser Ala Leu Val Glu Gin Cys Leu Ala Gly Ser Asp Gin 
4130 4135 4140 



get ggg agg gtg gcg gcg ate ccg 

Ala Gly Arg Val Ala Ala He Pro 
4145 4150 

gtc gag acc gcg gtg gcg gec ctg 

Val Glu Thr Ala Val Ala Ala Leu 
4165 



ctg atg cgc agg gac cgc gat gag 124 80 
Leu Met Arg Arg Asp Arg Asp Glu 
4155 4160 

gcg cac gtg cac gtc cgc ggt ggt 12528 
Ala His Val His Val Arg Gly Gly 
4170 4175 
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gcg gtg gac tgg teg get tgc ttc gec ggc ace ggc gee cgc acc gtc 
Ala Val Asp Trp Ser Ala Cys Phe Ala Gly Thr Gly Ala Arg Thr Val 
4180 4185 4190 



12576 



gag ttg ccc acc tac gec ttc caa cgc cag egg tac tgg ctg gee ggg 
Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Ala Gly 
4195 4200 4205 



12624 



caa gcg gac ggg cgc ggc ggc gat gtg gtt gee gac ccg gtc gac gcg 
Gin Ala Asp Gly Arg Gly Gly Asp Val Val Ala Asp Pro Val Asp. Ala 
4210 4215 4220 



12672 



cgc ttc tgg gag ttg gtc gag cgc gee gat ccg gaa ccg ttg gtg gat 
Arg Phe Trp Glu Leu Val Glu Arg Ala Asp Pro Glu Pro Leu Val Asp 
4225 4230 4235 4240 



12720 



gaa etc tgc ate gac egg gac cag ccc ttc egg gag gtg ctg ccc gtt 
Glu Leu Cys lie Asp Arg Asp Gin Pro Phe Arg Glu Val Leu Pro Val 
4245 4250 4255 



12768 



ctg get tec tgg cgc gag aaa caa cgc cag gag gee etc gcg gat tec 
Leu Ala Ser Trp Arg Glu Lys Gin Arg Gin Glu Ala Leu Ala Asp Ser 
4260 4265 4270 



12816 



tgg cgc tac cag gtg cgc tgg agg tec gtc gag gtg ccg tec gca gec 
Trp Arg Tyr Gin Val Arg Trp Arg Ser Val Glu Val Pro Ser Ala Ala 
4275 4280 4285 



12864 



gec etc egg ggc gtg tgg ctg gtg gtg ctt cca get gac gtg ccc cga 
Ala Leu Arg. Gly Val Trp Leu Val Val Leu Pro Ala Asp Val Pro Arg 
4290 4295 4300 



12912 



gat caa ccg gcg gtc gtc ate gac gcg ctg ate gcg cgc ggc gee gag 
Asp Gin Pro Ala Val Val He Asp Ala Leu He Ala Arg Gly Ala Glu 
4305 4310 4315 4320 



12960 



gtc gcg gtc ctg gaa ttg acc gag cag gac etc caa cgc agt gcg ctt 
Val Ala Val Leu Glu Leu Thr Glu Gin Asp Leu Gin Arg Ser Ala Leu 
4325 4330 4335 



13008 



gtg gac aag gtg cgc gec gtc att gcg gac cgc acc gag gtg acg ggt 
Val Asp Lys Val Arg Ala Val He Ala Asp Arg Thr Glu Val Thr Gly 
4340 4345 4350 



13056 



gtg ttg tct ctg ttg gcg atg gac ggc atg ccc tgc gcg gcg cat ccg 
Val Leu Ser Leu Leu Ala Met Asp Gly Met Pro Cys Ala Ala His Pro 
4355 4360 4365 



13104 
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cac ctg tec cgt ggt gtc gec get acc gtg ate ctg acg cag gtg ttg 13152 
His Leu Ser Arg Gly Val Ala Ala Thr Val He Leu Thr Gin Val Leu 
4370 4375 4380 

ggc gat gcg ggt gtt tec gee ccg ctg tgg ctg gee acg acc ggt ggc 132 00 

Gly Asp Ala Gly Val Ser Ala Pro Leu Trp Leu Ala Thr Thr Gly Gly 
4385 4390 4395 4400 

gtc gag gec ggg acc gag gac ggt ccg gee gat ccg gac cac ggc ttg 1324 8 

Val Glu Ala Gly Thr Glu Asp Gly Pro Ala Asp Pro Asp His Gly Leu 
4405 4410 4415 

ate tgg ggg etc ggc agg gtc gtc ggc ctt gaa cat ccg cag tgg tgg 13296 

He Trp Gly Leu Gly Arg Val Val Gly Leu Glu His Pro Gin Trp Trp 
4420 4425 4430 



ggt ggc ctg ate gac ctt ccg gag aca ctg gac gag acg tec egg aac 13344 
Gly Gly Leu He Asp Leu Pro Glu Thr Leu. Asp Glu Thr Ser Arg Asn . 
4435 4440 4445 



ggg ttg gtg gee gca etc gec ggg acg gcg gee gaa gat cag etc gee 13392 
Gly Leu Val Ala Ala Leu Ala Gly Thr Ala Ala Glu Asp Gin Leu Ala 
4450 4455 4460 

gtg cgt tea tec ggg ttg ttc gtt cgc aga gtg gtg cgc gca gcg egg 13440 
Val Arg Ser Ser Gly Leu Phe Val Arg Arg Val Val Arg Ala Ala Arg 
4465 4470 4475 4480 



aac ccc egg tea gag aca tgg cgt age egg gga acg gtc etc ate acg 13488 
Asn Pro Arg Ser Glu Thr Trp Arg Ser Arg Gly Thr Val Leu He Thr 
4485 4490 4495 



ggc gga aca ggc gcg etc ggt gee gag gtc gca cga tgg ctg gee egg 13536 
Gly Gly Thr Gly Ma Leu Gly Ala Glu Val Ala Arg Trp Leu Ala Arg 
4500 4505 4510 

egg gga get gag cac ctg gtg ttg ate agt cgc cgc ggc ccg gaa get 13584 
Arg Gly Ala Glu His Leu Val Leu He Ser Arg Arg Gly Pro Glu Ala 
4515 4520 4525 



ccc ggc gca gcg gac eta ggg gec gag ctg act gaa etc ggc gtg aaa 13632 
Pro Gly Ala Ala Asp Leu Gly Ala Glu Leu Thr Glu Leu Gly Val Lys 
4530 4535 " 4540 



gtc aca gtc ttg gee tgc gat gtg acg gac cgc gac gag ctg gcg gcg 13680 
Val Thr Val Leu Ala Cys Asp Val Thr Asp Arg Asp Glu Leu Ala Ala 
4545 4550 4555 4560 
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gtg ctg gcg gcc gtt ccc acg gag tat ccg ctg teg gcg gtc gtg cac 13728 

Val Leu Ala Ala Val Pro Thr Glu Tyr Pro Leu Ser Ala Val Val His 
4565 4570 4575 

acc gcc ggc gtc ggg acg cct gcg aac ctg gcc gag acg acc ttg gcg 13776 

Thr Ala Gly Val Gly Thr Pro Ala Asn Leu Ala Glu Thr Thr Leu Ala 
4580 45B5 4590 



cag ttc gcc gac gtg ttg teg gcc aag gtc gtc ggc gcg gcg aac ctg 
Gin Phe Ala Asp Val Leu Ser Ala Lys Val Val Gly Ala Ala Asn Leu 
4595 4600 4605 



13824 



gac egg ctg ctt ggc ggg caa ccg ttg gac gcc ttc gtg ctg ttc tec 
Asp Arg Leu Leu Gly Gly Gin Pro Leu Asp Ala Phe Val Leu Phe Ser 
4610 4615 4620 



13872 



teg ate teg gga gtt tgg gga gcc ggc ggc caa gga gcc tat teg gcc 
Ser lie Ser Gly Val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ser Ala 
4625 4630 4635 4640 



13920 



gcc aat gcg tat etc gat gcc ctt gcc gag cgc cga egg get tgc ggg 
Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Arg Arg Arg Ala Cys Gly 
4645 4650 4655 



13968 



egg ccg gcg acg tgc ate gcc tgg ggt ccg tgg gcg ggt gcg ggc atg 
Arg Pro Ala Thr Cys He Ala Trp Gly Pro Trp Ala Gly Ala Gly Met 
4660 4665 4670 



14016 



gcc gtt cag gaa ggt aac gag gcg cat etc cgc cga agg ggc ctg gta 
Ala Val Gin Glu Gly Asn Glu Ala His Leu Arg Arg Arg Gly Leu Val 
4675 4680 4685 



14064 



ccg atg gaa ccg cag teg gcc etc ttc gcg ctg caa cag gcc ctg tec 
Pro Met Glu Pro Gin Ser Ala Leu Phe Ala Leu Gin Gin Ala Leu Ser 
4690 4695 4700 



14112 



caa cga gaa acc gcc ate acc gtc gca gat gtg gac tgg gag cga ttc 
Gin Arg Glu Thr Ala He Thr Val Ala Asp Val Asp Trp Glu Arg Phe 
4705 4710 4715 ' 4.720 



14160 



gcc gcc tct ttc acc gcg gcc. cgc ccg cga cca ctg ttg gaa gag ate 
Ala Ala Ser Phe Thr Ala Ala Arg Pro Arg Pro Leu Leu Glu Glu He 
4725 4730 4735 



14208 



gtg gat eta egg ccc gac acc gag acc gag gag aag cac ggt gcc ggc 
Val Asp Leu Arg Pro Asp Thr Glu Thr Glu Glu Lys His Gly Ala Gly 
4740 4745 4750 



14256 
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gag ctg ggg cag cag ctg gcc gca ctg ccg ccc get gag cgc gga cac 14304 
Glu Leu Gly Gin Gin Leu Ala Ala Leu Pro Pro Ala Glu Arg Gly His 
4755 4760 4765 

ctg ctg ctg gag gtg gtg ctg gcg gaa acc gcc age acc ctg ggg cac 14352 
Leu Leu Leu Glu Val Val Leu Ala Glu Thr Ala Ser Thr Leu Gly His 
4770 4775 4780 



gat teg gcg gag get gtg caa ccc gat egg acc ttc gcc gaa ctg ggc 14400 
Asp Ser Ala Glu Ala Val Gin Pro Asp Arg Thr Phe Ala Glu Leu Gly 
4785 4790 4795 4800 

ttc gat teg ctg acc gcg gta gag ctg cgc aac agg ttg aac gcg gtg 1444 8 
Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn Ala Val 
'4805 4810 4815 



acc ggg ctt cgc ctg ccg ccg acg ctg gtt ttc gac cac ccg acg ccg 14496 
Thr Gly Leu Arg Leu Pro Pro Thr Leu Val Phe Asp His Pro Thr Pro 
4820 4825 4630 



ctg gcg ttg tec gaa cag ttg gtt ccg 
Leu Ala Leu Ser Glu Gin Leu Val Pro 
4835 4840 

aac ggc ate gaa teg ctg etc gcc gag 
Asn Gly lie Glu Ser Leu Leu Ala Glu 
4850 4855 

ttg gcg caa ggg cct teg ate cca ctg 
Leu Ala Gin- Gly Pro Ser lie Pro Leu 
4865 4870 



gcc ctg gtc gcg gag ccg gac 14544 
Ala Leu Val Ala Glu Pro Asp 
4845 

etc gac agg ctg gat acc acg 14592 
Leu Asp Arg Leu Asp Thr Thr 
4860 

gaa gac cag gcc aag gtg gcg 14640 
Glu Asp Gin Ala Lys Val Ala 
4875 4880 



gag cgc ttg cac gca etc etc gcc aag tgg gac ggg gcg cgt gac ggc 14688 
Glu Arg Leu His Ala Leu Leu Ala Lys Trp Asp Gly Ala Arg Asp Gly 
4885 4890 4895 

acg gcc aga gcg acg tea ccc caa teg ctg acg gcg gcc acg gac gac 14736 
Thr Ala Arg Ala Thr Ser Pro Gin Ser Leu Thr Ala Ala Thr Asp Asp 
4900 . 4905 4910 

gaa ate ttc gac etc ate gac egg aag ttc egg cgc tga 147 75 

Glu He Phe Asp Leu He Asp Arg Lys Phe Arg Arg 
4915 4920 



<210> 48 
<211> 4924 
<212> PRT 
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<213> Saccharopolyspora spinosa 
<400> 48 

Met Ala Asn Glu Glu Lys Leu Phe Gly Tyr Leu Lys Lys Val Thr Ala 
15 10 v 15 

Asp Leu His Gin Thr Arg Gin Arg Leu Leu Ala Ala Glu Ser Arg Ser 
20 25 30 

Gin Glu Pro lie Ala He Val Ser Ala Ser Cys Arg Leu Pro Gly Gly 
35 40 ' 45 

Val Asp Ser Pro Glu Ala Leu Trp Gin Leu Val Arg Thr Gly Thr Asp 
50 55 60 

Ala He Ser Glu Phe Pro Ala Asp Arg Gly Trp Asp Leu Gly Arg Leu 
65 70 75 80 

Tyr Asp Pro Asp Pro Asn His Gin Gly Thr Ser Tyr Thr Arg Ala Gly 
85 90 95 

Gly Phe Leu Ala Gly Ala Gly Asp Phe Asp Pro Ala Met Phe Gly He 
100 105 110 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu Leu Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro Thr Ser 
130 135 140 

Leu Arg Gly Ser Lys Thr Gly Val Phe Gly Gly Val Thr Pro Gin Glu 
145 150 155 160 

Tyr Gly Pro Ser Leu Gin Glu Met Ser Arg Asn Ala Gly Gly Phe Gly 
165 170 175 

Leu Thr Gly Arg Met Val Ser Val Ala Ser Gly Arg Val Ala Tyr Ser 
180 185 190 

Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 • 205 

Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu 
210 215 220 

Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala 
225 230 235 240 
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Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 

Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 

His Glu Val Leu Ala Val Val Arg Gly. Ser Ala Val Asn Gin Asp Gly 
290 295 300 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val 
305 310 315 320 

lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser Val Ser Asp Val Asp 
325 330 335 

Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro He Glu 
340 345 350 

Ala Gin Ala Leu He Ala Thr Tyr Gly Gin Gly Arg Glu Lys Asp Arg 
355 360 365 

Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly His Thr Gin Ala 
370 375 380 

Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg His 
385 390 395 400 

Gly Gin Leu Pro Ala Thr Leu His Val Asp Glu Pro Thr Ser Ala Val 
405 410 415 

Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Asn Thr Pro Trp 
420 425 430 

Pro Asp Ser Gly Arg Pro Cys Arg Val Gly Val Ser Ser Phe Gly He 
435 440 445 

Ser Gly Thr Asn Ala His Val He Leu Glu Gin Ser Pro Val Glu Gin 
450 455 460 

Gly Glu Pro Ala Gly Pro Val Glu Gly Glu Arg Glu Pro Asp Val Ala 
465 470 475 480 

Val Pro Val Val Pro Trp Val Leu Ser Gly Lys Thr Pro Glu Ala Ala 
485 490 495 
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Arg Ala Gin Ala Glu Arg Val His Ser His He Glu Asp Arg Pro Gly 
500 505 510 

Leu Ser Pro Val Asp Val Ala Tyr Ser Leu Gly Met Thr Arg Ala Ala 
515 520 525 

Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Ala Ala Leu Leu 
530 535 540 

Thr Gly Leu Arg Ala Phe Ala Asp Gly Cys Asp Ala Pro Glu Val Val 
545 550 555 560 

Ser Gly Ser Val Gly Leu Gly Gly Arg Val Gly Phe Val Phe Ser Gly 
565 570 575 

Gin Gly Gly Gin Trp Pro Gly Met Gly Arg Gly Leu Tyr Ser Val Phe 
580 585 590 

Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp Ala 
595 600 605 

His Leu Gly Gin Glu Leu Arg Val Arg Asp Val Val Phe Gly Ser Gin 
610 615 620 

Ala Trp Leu Leu Asp Arg Thr Val Trp Ala Gin Ser Gly Leu Phe Ala 
625 630 635 640 

Leu Gin He Gly Leu Leu Arg Leu Leu Gly Ser Trp Gly Val Arg Pro 
645 650 655 

Asp Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Val His Ala 
660 665 670 

Ala Gly Val Leu Ser Leu Ser Glu Ala Ala Arg Leu Val Ala Gly Arg 
675 680 685 

Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala Val 
690 695 700 

Ala Thr Gly Glu Phe Gin Val Asp Pro Leu Leu Asp Gly Val Arg Asp 
705 710 715 720 

Arg He Gly He Ala Ala Val Asn Gly Pro Glu Ser Val Val Leu Ser 
725 730 735 

Gly Asp Arg Glu Leu Leu Thr Glu He Ala Asp Arg Leu His Asp Gin 
740 745 750 
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Gly Cys Arg Thr Arg Trp Leu Arg Val Ser His Ala Phe.His Ser Pro 
755 760 765 

His Met Glu Pro Met Leu Glu Glu Phe Ala Gin lie Ser Arg Gly Arg 
770 775 780 

Glu Tyr His Ala Pro Glu Leu Pro lie lie Ser Thr Leu He Gly Glu 
785 790 795 800 

Leu Asp Gly Gly Arg Val Met Gly Thr Pro Glu Tyr Trp Val Arg Gin 
805 810 815 

Val Arg Glu Pro Val Arg Phe Ala Glu Gly Val Gin Ala Leu Val Gly 
820 825 830 

Gin Gly Val Gly Thr He Val Glu Leu Gly Pro Asp Gly Ala Leu Ser 
835 840 845 

Thr Leu Val Glu Glu Cys Val Ala Glu Ser Gly Arg Val Ala Gly He 
850 855 860 

Pro Leu Met Arg Lys Asp Arg Asp Glu Ala Arg Thr Val Leu Ala Ala 
865 870 875 880 

Leu Ala Gin He His Thr Arg Gly Gly Glu Val Asp Trp Arg Ser Phe 
885 890 895 

Phe Ala Gly Thr Gly Ala Lys Gin Val Asp Leu Pro Thr Tyr Ala Phe 
900 905 910 

Gin Arg Gin Arg Tyr Trp Leu Ala Ser Thr Gly Arg Ala Gly Asp Val 
915 920 925 

Thr Ala Ala Gly Leu Ala Glu Ala Asp His Pro Leu Leu Gly Ala Val 
930 935 940 

Val Ala Leu Ala Asp Gly Glu Gly Val Val Leu Thr Gly Arg Leu Thr 
945 950 955 960 

Ala Gly Ser His Pro Trp Leu Ser Asp His Arg Val Leu Gly Glu He 
965 970 975 

Val Val Pro Gly Thr Ala He Val Glu Leu Val Trp His Val Gly Glu 
980 985 990 

Arg Leu Gly Cys Gly Arg Val Glu Glu Leu Ala Leu Glu Ala Pro Leu ' 
995 1000 1005 
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lie Leu Pro Asp His Gly Ala Val Gin Val Gin Val Leu Val Gly Pro 
1010 1015 1020 

Pro Gly Glu Ser Gly Ala Arg Ser Val Ala Leu Tyr Ser Cys Pro Gly 

025 1030 1035 1040 ' 

Glu Ala lie Glu Pro Glu Trp Lys Lys His Ala Thr Gly Val Leu Leu 
1045 1050 1055 

Pro Pro Val Ala Ala Glu Asn His Glu Leu Thr Ala Trp Pro Pro Glu 
1060 1065 1070 

Asn Ala Thr Glu lie Asp Ala Asp Gly Val Tyr Ala Phe Leu Glu Gly 
1075 1080 1085 

His Gly Phe Ala Tyr Gly Pro Ala Phe Arg Cys Leu Arg Gly Ala Trp 
1090 1095 1100 

Arg Arg Gly Gly Glu Val Phe Ala Glu Val Ala Leu Pro Asp Asp Met 
105 1110 1115 1120 

Gin Ala Gly Val Asp Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala 
1125 1130 1135 

Val Leu His Ala Ala Ala Ala Glu Thr Ser Val Val Gin Ser Glu Ala 
1140 1145 1150 

Arg Val Pro Phe Ser Trp Arg Gly Val Glu Leu Arg Ala Thr Glu Ser 
1155 1160 1165 

Ala Val Val Arg Ala Arg Leu Ser Leu Thr Ser Asp Asp Glu Leu Ser . 
1170 1175 1180 

Leu Val Ala Val Asp Pro Ala Gly Arg Phe Val Ala Thr Val Asp Ser 
185 1190 1195 1200 

Leu Val Thr Arg Pro lie Ser Arg Gin Gin Val Arg Ser Gly Ala lie 
1205 1210 1215 

Gly Asp Cys Leu Phe Glu Val Glu Trp His Arg Lys Ala Leu Leu Gly 
1220 1225 1230 

Thr Thr Ala Gly Asp Asp Leu Ala lie Val Gly Asp Gly Pro Ser Trp 
1235 1240 1245 

Pro Glu Ser Val Arg Ala Thr Ala Arg Phe Ala Thr Leu Asp Glu Phe 
1250 1255 1260 
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Arg Ala Ala Val Asp Ser Asp Val Pro Ala Pro Gly Ser Val Leu Val 
265 1270 1275 1280 

Ala Ala Met Ser Ala Glu Glu Val Glu Gly Gly Ser Leu Pro Ser Arg 
• 1285 1290 1295 

Ala Gin Glu Ser Thr Ser Asp Leu Leu Ala Leu Val Gin Ser Trp Leu 
1300 1305 1310 

Ala Asp Glu Arg Phe Ala Glu Ser Gin Leu Val Val Val Thr Arg Ala 
1315 1320 1325 

Ala Val Ser Ala Asp Ser Asp Ser Asp Val Ala Asp Leu Val Gly Ala 
1330 1335 1340 

Ser Ser Trp Gly Leu Leu Ser Ser Ala Gin Ser Glu Asn Pro Gly Arg 
345 1350 1355 1360 

Phe Val Leu Val Asp Val Asp Gly Thr Pro Glu Ser Trp Gin Ala Leu 
1365 1370 1375 

Pro Ala Ala Val Arg Ala Gly Glu Pro Gin Leu Ala Leu Arg Arg Gly 
1380 1385 1390 

Val Ala Leu Val Pro Arg Leu Ala Arg Leu Thr Val Arg Glu Glu Gly 
1395 1400 1405 

Ser Ser Pro Gin Leu Asp Thr Asp Gly Thr Val Leu lie Thr Gly Gly 
1410 1415 1420 

Thr Gly Ala Leu Gly Gly Val Val Ala Arg His Leu Val Glu Glu His 
425 1430 1435 1440 

Gly He Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp Asn Ala Pro 
1445 1450 1455 

Gly Val His Glu Leu Val Asp Glu Leu Ala Arg Ala Gly Ala Val Val 
1460 1465 1470 

Glu Val Val Ala Cys Asp Val Ala Asp Arg Thr Asp Leu Glu His Val 
1475 1480 1485 

Leu Ala Ala He Pro Val Asp Trp Pro Leu Arg Gly He Val His Thr 
1490 1495 1500 

Ala Gly Val Leu Ala Asp Gly Val He Gly Ser Leu Ser Ala Ala Asp 
505 1510 1515 1520 
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Val Gly Thr Val Phe Ala Pro Lys Val Thr Gly Ala Trp His Leu His 
1525 1530 1535 

Glu Leu Thr Arg Asp Leu Asp Leu Ser Phe Phe Val Leu Phe Ser Ser 
1540 1545 1550 

Phe Ser Gly lie Ala Gly Ala Ala Gly Gin Ala Asn Tyr Ala Ala Ala 
1555 1560 1565 

Asn Thr Phe Leu Asp Ala Leu Ala Arg Tyr Arg Arg Ala Arg Gly Leu 
1570 1575 1580 

Pro Gly Leu Ser Leu Ala Trp Gly Leu Trp Ala Gin Pro Ser Gly Met 
.585 1590 1595 1600 

Thr Ser Gly Leu Asp Ala Ala Ser Val Glu Arg Leu Ala Arg Thr Gly 
1605 1610 1615 

He Ala Glu Leu Ser Thr Glu Asp Gly Leu Arg Leu Phe Asp Ala Ala 
1620 1625 1630 

Phe Ala Lys Asp Arg Ala Cys Val Val Ala Ala Arg Leu Asp Arg Ala 
1635 1640 1645 

Leu Leu Val Gly Asn Gly Arg Ser His Ala He Pro Ala Leu Leu Ser 
1650 1655 1660 

Ala Leu Val Pro Val Arg Gly Gly Val Ala Arg Lys Thr Ala Asn Ser 
665 1670 1675 1680 

Gin Ala Ala Asp Glu Asp Ala Leu Leu Gly Leu Val Arg Glu His Val 
16.85 1690 1695 

Ser Ala Val Leu Gly Tyr Ser Gly Ala Val Glu Val Gly Gly Asp Arg 
1700 1705 1710 

Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Leu Arg 
1715 1720 1725 

Asn Arg Leu Ala Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala Val 
1730 1735 1740 

Phe Asp Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu 
745 1750 1755 1760 

Leu Ala Gly Glu Val Ala Ser Thr Ser Thr Pro Val Thr Arg Ala Ala 
1765 1770 1775 
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Ser Ala Glu Glu Asp Leu .Val Ala lie Val Gly Met Gly Cys. Arg Phe 
1780 1785 1790 

Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly 
1795 1800 1805 

Gly Val Asp Ala Val Ala Gly Phe Pro Asp Asp Arg Gly Trp Asp Leu 
1810 1815 1820 

Ala Ala Leu Tyr Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr Val 
825 1830 1835 1840 

Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Asp Met 
1845 1850 1855 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
I860 1865 1870 

Leu Leu Leu Glu Val Ala Trp Glu Thr Leu Glu Arg Ala Gly lie Asp 
1875 1880 1885 

Pro Phe Ser Leu His Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met 
1890 1895 1900 

Tyr His Asp Tyr Gly Ala Arg Phe He Thr Arg Ala Pro Glu Gly Phe 
905 1910 1915 1920 

Glu Gly His Leu Gly Thr Gly Asn Ala Gly Ser Val Leu Ser Gly Arg 
1925 1930 1935 

Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
1940 1945 1950 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ala Leu 
1955 1960 1965 

Arg Ala Gly Glu Cys Glu Phe Ala Leu Ala Gly Gly Val Thr Val Met 
1970 1975 1980 

Ser Thr Pro Thr Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
985 1990 1995 2000 

Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly 
2005 2010 2015 

Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala 
2020 2025 2030 
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Arg Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser Ala Val 
2035 2040 2045 

Asn Gin Asp Gly Ala Ser Asn : Gly Leu Thr Ala Pro Asn Gly Pro Ser 
2050 2055 2060 

Gin Gin Arg Val lie Thr Gin Ala Leu Thr Ser Ala Gly Leu Ser Val 
065 2070 2075 2080 

Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
2085 2090 2095 

Asp Pro lie Glu Ala Gin Ala Leu lie Ala Thr Tyr Gly Arg Asp Arg 
2100 2105 2110 

Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn lie Gly 
2115 2120 2125 

His Thr Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val Met 
2130 2135 2140 

Ala Met Arg Gin Gly Glu Leu Pro Arg Thr Leu His Val Asp Glu Pro 
145 2150 2155 2160 

Ser Ala Gin Val Asp Trp Ser Ala Gly Thr Val Gin Leu Leu Thr Glu 
2165 2170 2175 

Asn Thr Pro Trp Pro Asp Ser Gly Arg Leu Arg Arg Ala Gly Val Ser 
2180 2185 2190 

Ser Phe Gly lie Ser Gly Thr Asn Ala His Leu lie Leu Glu Gin Pro 
2195 2200 2205 

Pro Arg Glu Ser Gin Arg Ser Thr Glu Pro Asp Ser Gly Ser Val Arg 
2210 2215 2220 

Asp Phe Pro Val Val Pro Trp Met Val Ser Gly Lys Thr Pro Glu Ala 
225 2230 2235 2240 

Leu Ser Ala Gin Ala Asp Ala Leu Met Ser Tyr Leu Ser Asn Arg Val 
2245 2250 2255 

Asp Ala Ser Pro Arg Asp lie Gly Tyr Ser Leu Ala Val Thr Arg Pro 
2260 2265 2270 

Ala Leu Asp His Arg Ala Val Val Leu Gly Ala Asp Arg Ala Ala Leu 
2275 2280 2285 
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Leu Pro Gly Leu Lys Ala Leu Ala Val Ser Asn Asp Ala Ala Glu Val 
2290 2295 2300 

He Thr Gly Thr Arg Ala Ala Gly Pro Val Gly Phe Val Phe Ser Gly 
305 2310 2315 2320 

Gin Gly Gly Gin Trp Pro Gly Met Gly Ser Gly Leu His Ser Ala Phe 
2325 2330 2335 

Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Cys Glu Leu Asp Ala 
2340 . 2345 2350 

His Leu Gly Gin Met Ala Arg Leu Arg Asp Val Leu Ser Gly Ser Asp 
2355 2360 2365 

Thr Gin Leu Leu Asp Gin Thr Leu Trp Ala -Gin Pro Gly Leu Phe Ala 
2370 2375 2380 

Leu Gin Val Gly Leu Trp Glu Leu Leu Gly Ser Trp Gly Val Arg Pro 
385 2390 2395 2400 

Ala Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe Ala 
2405 2410 2415 

Ala Gly Val Leu Ser Leu Arg Asp Ala Ala Arg Leu Val Ala Gly Arg 
2420 2425 2430 

Ala Arg Leu Met Gin Ala Leu Pro Thr Gly Gly Ala Met Leu Ala Ala 
2435 2440 2445 

Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala Asp Cys Gly Asp 
2450 2455 2460 

Arg Val Gly He Ala Ala Val Asn Ala Pro Gly Ser Val Val Leu Ser 
465 2470 2475 2480 

Gly Asp Arg Asp Val Leu Asp Asp He Ala Gly Arg Leu Asp Gly Gin 
2485 2490 2495 

Gly He Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser His 
2500 2505 2510 

Arg Met Asp Pro Met Leu Ala Glu Phe Thr Glu He Ala Arg Ser Val 
2515 2520 2525 

Asp Tyr Arg Ser Ser Gly Leu Pro He Val Ser Thr Leu Thr Gly Glu 
2530 2535 2540 
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Leu Asp Glu Val Gly Met Pro Ala Thr Pro Glu Tyr Trp Val Arg Gin 
545 2550 2555 2560 

Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Ala Ala Leu Ala Ala ■. 

2565 2570 2575 

His Gly Val Ser Thr Val Val Glu Val Gly Pro Asp Gly Val Leu Ser 
2580 2585 2590 

Ala Leu Val Gin Glu Cys Ala Ala Gly Ser Asp Gin Gly Gly Arg Val 
2595 2600 2605 

Ala Ala Val Pro Leu Met Arg Ser Asn Arg Asp Glu Ala His Thr Val 
2610 2615 2620 

Thr Thr Ala Leu Ala Gin lie His Val Arg Gly Ala Glu Val Asp Trp 
625 2630 2635 2640 

Arg Ser Phe Phe Ala Gly Thr Gly Ala Lys Gin Val Glu Leu Pro Thr 
2645 2650 2655 

Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Asp Ser Pro Ser Glu Pro 
2660 2665 2670 

Val Gly Gin Ser Ala Asp Pro Ala Arg Gin Ser Gly Phe Trp Glu Leu 
2675 2680 2685 

Val Glu Gin Glu Asp Val Ser Ala Leu Ser Ala Ala Leu His lie Thr 
2690 2695 2700 

Gly Asp His Asp Val Gin Ala Ser Leu Glu Ser Val Val Pro Val Leu 
705 2710 2715 2720 

Ser Ser Trp His Arg Arg lie Arg Asn Glu Ser Leu Val His Gin Trp 
2725 2730 2735 

Arg Tyr Arg lie Ser Trp His Glu Arg Ala Asp Leu Pro Asp Pro Ser 
2740 2745 2750 

Leu Ser Gly Thr Trp Leu Val Val Val Pro Glu Gly Trp Ser Ala Ser 
2755 2760 2765 

Arg Gin Val Leu Arg Phe Asn Glu Met Phe Glu Glu Arg Gly Cys Pro 
2770 2775 2780 

Ala Val Leu Phe Glu Leu Ala Gly His Asp Glu Glu Ala Leu Ala Gin 
785 2790 2795 2800 
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Arg Phe Arg Ser Leu Pro Val Ala Ser Gly Gly lie Ser Gly Val Leu 
2805 2810 2815 



Ser Leu Leu Ala Leu Asp Glu Ser Pro Ser Ser Pro Asn Ala Ala Leu 
2820 2625 2830 

Pro Asn Gly Ala Leu Asn Ser Leu Val Leu Leu Arg Ala Leu Arg Ala 
2835 2840 2845 

Ala Asp Val Ser Ala Pro Leu Trp Leu Ala Thr Cys Gly Gly Val Ala 
2850 2655 2860 

Val Gly Asp Val Pro Val Asn Pro Gly Gin Ala Leu Val Trp Gly Leu 
865 2870 2875 2880 

Gly Arg Val Val Gly Leu Glu His Pro Ala Trp Trp Gly Gly Leu Val 
2885 2890 2895 

Asp Val Pro Cys Leu Leu Asp Glu Asp Ala Arg Glu Arg Leu Ser Val 
2900 2905 2910 

Val Leu Ala Gly Leu Gly Glu Asp Glu lie Ala Val Arg Pro Gly Gly 
2915 2920 2925 

Val Phe Val Arg Arg Leu Glu Arg Ala Gly Ala Ala Ser Gly Ala Gly 
2930- 2935 2940 

Ser Val Trp Arg Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly 
945 2950 2955 2960 

Gly Leu Gly Ala His Val Ala Arg Trp Leu Ala Gly Ala Gly Ala Glu 
2965 2970 2975 

His Val Val Leu Thr Ser Arg Arg Gly Ala Ala Ala Pro Gly Ala Gly 
2980 2985 2990 

Asp Leu Arg Ala Glu Leu Glu Ala Leu Gly Ala Arg Val Ser lie Thr 
2995 3000 3005 

Ala Cys Asp Val Ala Asp Arg Asp Ala Leu Ala Glu Val Leu Ala Thr 
3010 3015 3020 

lie Pro Asp Asp Cys Pro Leu Thr Ala Val Met His Ala Ala Gly Val 
025 3030 3035 3040 

Val Glu Val Gly Asp Val Ala Ser Met Cys Leu Thr Asp Phe Val Gly 
3045 3050 3055 



263 



WO 01/16303 PCT/EPOO/08013 

Val Leu Ser Ala Lys Ala Gly Gly Ala Ala Asn Leu Asp Glu Leu Leu 
3060 3065 3070 

Ala Asp Val Glu Leu' Asp Ala Phe Val Leu Phe Ser Ser Val Ser Gly 
3075 3080 3085 

Val Trp Gly Ala Gly Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Tyr 
3090 3095 3100 

Leu Asp Ala Leu Ala Gin Gin Arg Arg Ala Arg Gly Leu Val Gly Thr 
105 3110 3115 3120 

Ala Val Ala Trp Gly Pro Trp Ala Gly Asp Gly Met Ala Ala Gly Glu 
3125 3130 3135 

Gly Gly Ala Gin Leu Arg Arg Ala Gly Leu Val Pro Met Ala Ala Asp 
3140 3145 3150 

Arg Ala Leu Leu Ala Leu Gin Gly Ala Leu Asp Arg Asp Glu Thr Ser 
3155 3160 3165 

Leu Val Val Ala Asp Met Ala Trp Glu Arg Phe Ala Pro Val Phe Ala 
3170 3175 3180 

Met Ser Arg Arg Arg Pro Leu Leu Asp Glu Leu Pro Glu Ala Gin Gin 
185 3190 3195 3200 

Ala Leu Ala Asp Ala Glu Asn Thr Thr Asp Ala Ala Asp Ser Ala Val 
3205 3210 3215 

Pro Leu Pro Arg Leu Ala Gly Met Ala Ala Ala Glu Arg Arg Arg Ala 
3220 3225 3230 

Met Leu Asp Leu Val Leu Ala Glu Ala Ser lie Val Leu Gly His Asn 
3235 3240 3245 

Gly Ser Asp Pro Val Gly Pro Asp Arg Ala Phe Gin Glu Leu Gly Phe 
3250 3255 3260 

Asp Ser Leu Met Ala Val Glu Leu Arg Asn Arg Leu Gly Glu Ala Thr 
265 3270 3275 3280 

Gly Leu Ser Leu Pro Ala Thr Leu lie Phe Asp Tyr Pro Ser Pro Ser 
3285 3290 3295 

Ala Leu Ala Glu Gin Leu Val Gly Glu Leu Val Gly Ala Gin Pro Ala 
3300 3305 3310 
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Thr Thr Val Val Ala Gly Ala Asp Pro Val Asp Asp Pro Val Val Val 
3315 3320 3325 

Val Ala Met Gly Cys Arg Tyr Pro Gly Asp Val Cys. Ser Pro Glu Glu 
3330 3335 3340 

Leu Trp Gin Leu Val Ser Ala Gly Arg Asp Ala Val Ser Thr Phe Pro 
345 3350 3355 3360 

Val Asp Arg Gly Trp Asp Cys Asn Thr Leu Phe Asp Pro Asp Pro Asp 
3365 3370 3375 

Arg Ala Gly Ser Thr Tyr Val Arg Glu Gly Ala Phe Leu Thr Gly Ala 
3380 3385 3390 

Asp Arg Phe Asp Ala Gly Phe Phe Gly lie Ser Pro Arg Glu Ala Arg 
3395 3400 3405 

Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Val Ala Trp Glu Val 
3410 3415 3420 

Phe Glu Arg Ala Gly lie Ala Pro Leu Ser Leu Arg Gly Ser Arg Thr 
425 3430 3435 3440 

Gly Val Phe Ala Gly Thr Asn Gly Gin Asp His Gly Ala Lys Val Ala 
3445 3450 3455 

Ala Ala Pro Glu Ala Ala Gly His Leu Leu Thr Gly Asn Ala Ala Ser 
3460 3465 3470 

Val Leu Ala Gly Arg Leu Ser Tyr Thr Phe Gly Leu Glu Gly Pro Ala 
3475 3480 3485 

Val Ala Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu 
3490 3495 3500 

Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Asp Met Ala Leu Ala Gly 
505 3510 3515 3520 

Gly Val Thr Val Met Ser Thr Pro Leu Ala Phe Leu Glu Phe Ser Arg 
3525 3530 3535 

Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala 
3540 3545 3550 

Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu 
3555 3560 3565 
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Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Val 
3570 3575 3580 

Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala 
585 3590 3595 3600 

Pro Asn Gly Pro Ser Gin Gin Arg Val He Arg Gin Ala Leu Ala Asn 
3605 3610 3615 

Ala Gly Leu Ser Ala Ser Asp Val Asp Val Val Glu Ala His Gly Thr 
3620 3625 3630 

Gly Thr Gly Leu Gly Asp Pro He Glu Ala Gin Ala Leu He Ala Thr 
3635 3640 3645 

Tyr Gly Gin Glu Arg Asp Pro Glu Arg Ala Leu Trp Leu Gly Ser He 
3650 3655 3660 

Lys Ser Asn lie Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val 
665 3670 3675 3680 

He Lys Met Val Gin Ala Met Arg His Gly Glu Leu Pro Ala Thr Leu 
3685 3690 3695 

His Val Asp Lys Pro Thr Pro Gin Val Asp Trp Ser Ala Gly Ala Val 
3700 3705 3710 

Arg Leu Leu Thr Gly Asn Thr Pro Trp Pro Glu Ser Gly Arg Pro Arg 
3715 3720 3725 

Arg Ala Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Leu 
3730 3735 3740 

He Leu Glu Gin Pro Pro Ser Glu Pro Ala Glu He Asp Gin Ser Asp 
745 3750 3755 3760 

Arg Arg Val Thr Ala His Pro Ala Val He Pro Trp Met Leu Ser Ala 
3765 3770 3775 

Arg Ser Leu Ala Ala Leu Gin Ala Gin Ala Ala Ala Leu Gin Ala Arg 
3780 3785 3790 

Leu Asp Arg Gly Pro Gly Ala Ser Pro Leu Asp Leu Gly Tyr Ser Leu 
3795 3800 3805 

Ala Thr Thr Arg Ser Val Leu Asp Glu Arg Ala Val Val Trp Gly Ala 
3810 3815 3820 
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Asp Arg Glu Ala Leu Leu Ser Arg Leu Ala Ala Leu Ala Asp Gly Arg 
625 3830 3835 3840 

Thr Ala Pro Gly Va) He Thr Gly Ser Ala Asn Ser Gly Gly Arg He 
3845 3850 3855 

Gly Phe Val Phe Ser Gly Gin Gly Ser Gin Trp Leu Gly Met Gly Lys 
3860 3865 3870 

Ala Leu Cys Ala Ala Phe Pro Ala Phe Ala Asp Ala Phe Glu Glu Ala 
3875 3880 3885 

Cys Asp Ala Leu Ser Ala His Leu Gly Ala Asp Val Arg Gly Val Leu 
3890 3895 3900 

Phe Gly Ala Asp Glu Gin Met Leu Asp Arg Thr Leu Trp Ala Gin Ser 
905 3910 3915 3920 

Gly He Phe Ala Val Gin Val Gly Leu Leu Gly Leu Leu Arg Ser Trp 
3925 3930 3935 

Gly Val Arg Pro Ala Ala Val Leu Gly His Ser Val Gly Glu Leu Ala 
3940 3945 3950 

Ala Ala His Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg Leu 
3955 3960 3965 

Val Ala Ala Arg Ala His Leu Met Gin Ala Leu Pro Thr Gly Gly Ala 
3970 3975 3980 

Met Leu Ala Val Ala Thr Ser Glu Ala Ala Val Gly Pro Leu Leu Ser 
985 3990 3995 4000 

Gly Val Cys Asp Arg Val Ser He Ala Ala lie Asn Gly Pro Glu Ser 
4005 4010 4015 

Val Val Leu Ser Gly Asp Arg Asp Val Leu Val Glu Leu Ala Gly Glu 
4020 4025 4030 

Phe Asp Ala Arg Gly Leu Arg Thr Lys Trp Leu Arg Val Ser His Ala 
4035 4040 4045 

Phe His Ser His Arg Met Glu Pro He Leu Asp Glu Tyr Ala Glu Thr 
4050 4055 4060 

Ala Arg Cys Val Glu Phe Gly Glu Pro Val Val Pro He Val Ser Ala 
065 4070 4075 4080 
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Ala Thr Gly Ala Leu Asp Thr Thr Gly Leu Met Cys Ala Ala Asp Tyr 
4085 4090 4095 

Trp Thr Arg Gin Val Arg Asp Pro Val Arg Phe Gly Asp Gly Val Arg 
4100 4105 4110 

Ala Leu Val Gly Gin Gly Val Asp Thr He Val Glu Phe Gly Pro Asp 
4115 4120 4125 

Gly Ala Leu Ser Ala Leu Val Glu Gin Cys Leu Ala Gly Ser Asp Gin 
4130 4135 4140 

Ala Gly Arg Val Ala Ala He Pro Leu Met Arg Arg Asp Arg Asp Glu 
145 4150 4155 4160 

Val Glu Thr Ala Val Ala Ala Leu Ala His Val His Val Arg Gly Gly 
4165 4170 4175 

Ala Val Asp Trp Ser Ala Cys Phe Ala Gly Thr Gly Ala Arg Thr Val 
4180 4185 4190 

Glu Leu Pro Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Ala Gly 
4195 4200 4205 

Gin Ala Asp Gly Arg Gly Gly Asp Val Val Ala Asp Pro Val Asp Ala 
4210 4215 4220 

Arg Phe Trp Glu Leu Val Glu Arg Ala Asp Pro Glu Pro Leu Val Asp 
225 4230 4235 4240 

Glu Leu Cys He Asp Arg Asp Gin Pro Phe Arg Glu Val Leu Pro Val 
4245 4250 4255 

Leu Ala Ser Trp Arg Glu Lys Gin Arg Gin Glu Ala Leu Ala Asp Ser 
4260 4265 4270 

Trp Arg Tyr Gin Val Arg Trp Arg Ser Val Glu Val Pro Ser Ala Ala 
4275 4280 4285 

Ala Leu Arg Gly Val Trp Leu Val Val Leu Pro Ala Asp Val Pro Arg 
4290 4295 4300 

Asp Gin. Pro Ala Val Val He Asp Ala Leu He Ala Arg Gly Ala Glu 
305 4310 4315 4320 

Val Ala Val Leu Glu Leu Thr Glu Gin Asp Leu Gin Arg Ser Ala Leu 
4325 4330 4335 
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Val Asp Lys Val Arg Ala Val lie Ala Asp Arg Thr Glu Val Thr Gly 
4340 4345 4350 

Val Leu Ser Leu Leu Ala Met Asp Gly Met Pro Cys Ala Ala His Pro 
4355 4360 4365 

His Leu Ser Arg Gly Val Ala Ala Thr Val He Leu Thr Gin Val Leu 
4370 4375 4380 

Gly Asp Ala Gly Val Ser Ala Pro Leu Trp Leu Ala Thr Thr Gly Gly 
385 4390 4395 4400 

Val Glu Ala Gly Thr Glu Asp Gly Pro Ala Asp Pro Asp His Gly Leu 
4405 4410 4415 

He Trp Gly Leu Gly Arg Val Val Gly Leu Glu His Pro Gin Trp Trp 
4420 4425 4430 

Gly Gly Leu He Asp Leu Pro Glu Thr Leu Asp Glu Thr Ser Arg Asn 
4435 4440 4445 

Gly Leu Val Ala Ala Leu Ala Gly Thr Ala Ala Glu Asp Gin Leu Ala 
4450 4455 4460 

Val Arg Ser Ser Gly Leu Phe Val Arg Arg Val Val Arg Ala Ala Arg 
465 4470 4475 4480 

Asn Pro Arg Ser Glu Thr Trp Arg Ser Arg Gly Thr Val Leu He Thr 
4485 4490 4495 

Gly Gly Thr Gly Ala Leu Gly Ala Glu Val Ala Arg Trp Leu Ala Arg 
4500 4505 4510 

Arg Gly Ala Glu His Leu Val Leu He Ser Arg Arg Gly Pro Glu Ala 
4515 4520 4525 

Pro Gly Ala Ala Asp Leu Gly Ala Glu Leu Thr Glu Leu Gly Val Lys 
4530 4535 4540 

Val Thr Val Leu Ala Cys Asp Val Thr Asp Arg Asp Glu Leu Ala Ala 
545 4550 4555 4560 

Val Leu Ala Ala Val Pro Thr Glu Tyr Pro Leu Ser Ala Val Val His 
4565 4570 4575 

Thr Ala Gly Val Gly Thr Pro Ala Asn Leu Ala Glu Thr Thr Leu Ala 
4580 4585 4590 
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Gin Phe Ala Asp Val Leu Ser Ala Lys Val Val Gly Ala Ala Asn Leu 
4595 4600 4605 



Asp Arg Leu Leu Gly Gly Gin Pro 
4610 4615 

Ser He Ser Gly Val Trp Gly Ala 
625 4630 

Ala Asn Ala Tyr Leu Asp Ala Leu 
4645 



Leu Asp Ala Phe Val Leu Phe Ser 
4620 

Gly Gly Gin Gly Ala Tyr Ser Ala 
4635 4640 

Ala Glu Arg Arg Arg Ala Cys Gly 
4650 4655 



Arg Pro Ala Thr Cys He Ala Trp Gly Pro Trp Ala Gly Ala Gly Met 
4660 4665 4670 

Ala Val Gin Glu Gly Asn Glu Ala His Leu Arg Arg Arg Gly Leu Val 
4675 4680 4685 

Pro Met Glu Pro Gin Ser Ala Leu Phe Ala Leu Gin Gin Ala Leu Ser 
4690 4695 4700 

Gin Arg Glu Thr Ala He Thr Val Ala Asp Val Asp Trp Glu Arg Phe 
705 4710 4715 4720 

Ala Ala Ser Phe Thr Ala Ala Arg Pro Arg Pro Leu Leu Glu Glu He 
4725 4730 4735 

Val Asp Leu Arg Pro Asp Thr Glu Thr Glu Glu Lys His Gly Ala Gly 
4740 4745 4750 

Glu Leu Gly Gin Gin Leu Ala Ala Leu Pro Pro Ala Glu Arg Gly His 
4755 4760 4765 

Leu Leu Leu Glu Val Val Leu Ala Glu Thr Ala Ser Thr Leu Gly His 
4770 4775 4780 

Asp Ser Ala Glu Ala Val Gin Pro Asp Arg Thr Phe Ala Glu Leu Gly 
785 4790 4795 4800 

Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn Ala Val 
4805 4810 4815 

Thr Gly Leu Arg Leu Pro Pro Thr Leu Val Phe Asp His Pro Thr Pro 
4820 4825 4830 

Leu Ala Leu Ser Glu Gin Leu Val Pro Ala Leu Val Ala Glu Pro Asp 
4835 4840 4845 



270 



WO 01/16303 



PCT/EPOO/08013 



Asn Gly lie Glu Ser Leu Leu Ala Glu Leu Asp Arg Leu Asp Thr Thr 
4850 4855 4860 

Leu Ala Gin Gly Pro Ser lie Pro Leu Glu Asp Gin Ala Lys Val Ala 
865 4870 4875 4880 

Glu Arg Leu His Ala Leu Leu Ala Lys Trp Asp Gly Ala Arg Asp Gly 
4885 4890 4895 

Thr Ala Arg Ala Thr Ser Pro Gin Ser Leu Thr Ala Ala Thr Asp Asp 
4900 4905 4910 

Glu lie Phe Asp Leu lie Asp Arg Lys Phe Arg Arg 
4915 4920 



<210> 49 
<211> 16767 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (16764) 

<223> ORF22; Polyketidsynthase 

<400> 49 

atg gcc aat gaa gaa aag etc cgc gag tac etc aag cgt gtc gtc gtc 48 

Met Ala Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg Val Val Val 

15 10 15 

gaa ctg gaa gag gcg cac gaa cgc ctg cac gag ttg gag cgc cag gag 96 
Glu Leu Glu Glu Ala His Glu Arg Leu His Glu Leu Glu Arg Gin Glu 
20 25 30 

cac gac ccc ate gcg ate gtg teg atg gga tgt cgt tat ccc ggt ggc 144 
His Asp Pro lie Ala lie Val Ser Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 

gtc tec act ccg gag gag ctg tgg cga ctg gtc gtc gac gga gga gac 192 
Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Val Val Asp Gly Gly Asp 
50 55 60 

gcg ate gcg aac ttc ccc gaa gac cgt ggc tgg aat ctg gac gag ctg 24 0 
Ala lie Ala Asn Phe Pro Glu Asp Arg Gly Trp Asn Leu Asp Glu Leu 
65 70 75 80 
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ttc gat cct gat ccg ggc cga gcc ggg acc tec tac gtc cgc gag ggt 288 
Phe Asp Pro Asp Pro Gly Arg Ala Gly Thr Ser Tyr Val Arg Glu Gly 
85 90 95 

ggt ttc ctg cgc ggg gtc gcg gac ttc gat gcc ggg etc ttc ggg ate 336 
Gly Phe Leu Arg Gly Val Ala Asp Phe Asp Ala Gly Leu Phe Gly lie 
100 105 110 

agt ccg cgc gag gca cag gcg atg gac ccg caa cag egg ttg ctg ctg 384 
Ser Pro Arg Glu Ala Gin Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

gag ate teg tgg gag gtg ttc gag cgc gcc ggc att gac ccg ttt tct 432 
Glu lie Ser Trp Glu Val Phe Glu Arg Ala Gly lie Asp Pro Phe Ser 
130 135 140 

ttg egg ggt acc aag acc ggt gtg ttc gcg ggc ctg ate tac cac gac 480 
Leu Arg Gly Thr Lys Thr Gly Val Phe Ala Gly Leu lie Tyr His Asp 
145 150 155 160 

tac gcg teg egg ttt cgc aag acc ccc gcg gag ttc gag ggt tac ttc 528 
Tyr Ala Ser Arg Phe Arg Lys Thr Pro Ala Glu Phe Glu Gly Tyr Phe 
165 170 175 

gcc acc ggc aac gcg ggc age gtc gca tec ggc egg gtg get tac acc 576 
Ala Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Thr 
180 185 190 

ttc ggg tta gag ggc ccg gcg gtc acc gtg gac acc gcc tgc teg teg 624 
Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

tec ctg gtg gcg ctg cac ctg gcc tgc cag tec ctg egg ctg ggc gaa 672 
Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Leu Gly Glu 
210 215 220 

tgc gac ctg gcc ctg gcc ggt ggc att teg gtg atg gcc acg ccg gga 720 
Cys Asp Leu Ala Leu Ala Gly Gly lie Ser Val Met Ala Thr Pro Gly 
225 230 235 240 

gcc ttc gtc gag ttc age egg caa cgc gca etc gcc teg gat ggc egg 768 
Ala Phe Val Glu Phe Ser Arg Gin Arg Ala Leu Ala Ser Asp Gly Arg 
245 250 255 

tgc aag ccc ttc gcg gat gcc gcc gac ggc acc ggc tgg ggc gag ggc 816 
Cys Lys Pro Phe Ala Asp Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 
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gcc gga atg ctg ctg ctg gaa egg ctg teg gac gca cga cga aac ggc 864 

Ala Gly Met Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 

275 260 285 



cac ccg gtg ctg gcg* gcg gtg gtc ggt tec gcg ate aac cag gac ggg 912 
His Pro Val Leu Ala Ala Val Val Gly Ser Ala He Asn Gin Asp Gly 
290 295 300 



acg tec aac ggc ctg ace gcg ccc age 
Thr Ser Asn Gly Leu Thr Ala Pro Ser 
305 310 

ate cgc caa gcc ctg gcg aac gcc ggg 
He Arg Gin Ala Leu Ala Asn Ala Gly 
325 

gtg gtc gag gcg cac ggc acg ggc acg 
Val Val Glu Ala His Gly Thr Gly Thr 
340 345 



ggt ccc gca cag cag cga gtg 960 
Gly Pro Ala Gin Gin Arg Val 
315 320 

ttg teg ccc gcc gag gtc gat 1008 
Leu Ser Pro Ala Glu Val Asp 
330 335 

gcc ttg ggc gac ccg ate gag 1056 
Ala Leu Gly Asp Pro He Glu 
350 



gcg cag gcc ctg ate gcc ace tac ggg gcg aac egg teg gcg gat cat 1104 
Ala Gin Ala Leu He Ala Thr Tyr Gly Ala Asn Arg Ser Ala Asp His 
355 360 365 



ccg ctg ctg ctg ggt tec etc aag teg aac ate ggc cac ace cag get 1152 
Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 



gcc gcc ggt gtg gcc ggg gtg ate aag 
Ala Ala Gly Val Ala Gly Val He Lys 
385 390 

egg gag atg ccc cgc age ctg cac ate 
Arg Glu Met Pro Arg Ser Leu His He 
405 



teg gtc ctg gcc ate agg cac 1200 
Ser Val Leu Ala He Arg His 
395 400 

gac cag cca teg cag cac gtg 1248 
Asp Gin Pro Ser Gin His Val 
410 415 



gac tgg teg gcg ggc gcg gtg egg ctg etc acg gac age gtt gac tgg 1296 
Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Asp Ser Val Asp Trp 
420 425 430 

ccg gat etc ggc agg ccg cgc cga gca ggg gtg tec teg ttc ggc atg 1344 
Pro Asp Leu Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Met 
435 440 445 



age ggt ace aac gca cac ctg ate gtc gag gaa gta tec gac gag ccg 1392 
Ser Gly Thr Asn Ala His Leu He Val Glu Glu Val Ser Asp Glu Pro 
450 455 460 
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gtc teg ggc agt acc gag ccg acc ggg gca ttt ccc tgg ccg ctg tec 1440 
Val Ser Gly Ser Thr Glu Pro Thr Gly Ala Phe Pro Trp Pro Leu Ser 
465 470 475 480 

ggc aag acg gag acg gca ttg cgc gag cag get gee gag ttg etc tec 1488 
Gly Lys Thr Glu Thr Ala Leu Arg Glu Gin Ala Ala Glu Leu Leu Ser 
485 490 495 

gta gtg acc gag cac ccg gag ccg gga ctg ggg gac gtc ggg tac teg 1536 
Val Val Thr Glu His Pro Glu Pro Gly Leu Gly Asp Val Gly Tyr Ser 
500 505 510 

ctg gee acc ggt cgc get gcg atg gag cac egg get gtc gtg gtt gee 1584 
Leu Ala Thr Gly Arg Ala Ala Met Glu His Arg Ala Val Val Val Ala 
515 520 525 

gac gat egg gac tct ttc gtc gee gga ctg acg gcg ttg get gcg ggc 1632 
Asp Asp Arg Asp Ser Phe Val Ala Gly Leu Thr Ala Leu Ala Ala Gly 
530 535 540 

gtt ccg gca gee aac gtg gtg cag ggc gcg gee gac tgc aag gga aag 1680 
Val Pro Ala Ala Asn Val Val Gin Gly Ala Ala Asp Cys Lys Gly Lys 
545 550 555 560 

gtc gcg ttc gtg ttc ccc ggc cag ggc teg cat tgg cag ggg atg gcg 1728 
Val Ala Phe Val Phe Pro Gly Gin Gly Ser His Trp Gin Gly Met Ala 
565 570 575 

agg gaa ctg tec gaa tec teg ccg gtg ttc egg egg aag ctg gcg gaa 1776 
Arg Glu Leu Ser Glu Ser Ser Pro Val Phe Arg Arg Lys Leu Ala Glu 
580 585 590 

tgc gcg gcg get acg gee cct tac gtg gac tgg teg ctg etc ggc gtc 1824 
Cys Ala Ala Ala Thr Ala Pro Tyr Val Asp Trp Ser Leu Leu Gly Val 
595 600 605 

ctt cgc ggt gat ccc gat gca ccc gcg ctg gat cgc gac gac gtg att 1872 
Leu Arg Gly Asp Pro Asp Ala Pro Ala Leu Asp Arg Asp Asp Val lie 
610 615 620 

cag etc gcg ctg ttc gec atg atg gtg teg ctg gee gaa ctg tgg cgt 1920 
Gin Leu Ala Leu Phe Ala Met Met Val Ser Leu Ala Glu Leu Trp Arg 
625 630 635 640 

teg tgc gga gtg gag ccc gee gcg gtg gtc ggt cat tec cag ggc gag 196 8 
Ser Cys Gly Val Glu Pro Ala Ala Val Val Gly His Ser Gin Gly Glu 
645 650 655 
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ate gec gec gec cat gtg gca ggc get ttg tec ttg act gat gcg gtg 2016 
lie Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu Thr Asp Ala Val 
660 665 670 

cgc ate ate get gee cgc tgc gat gcg gtg teg gcg ctg ace ggg aag 2064 
Arg lie lie Ala Ala Arg Cys Asp Ala Val Ser Ala Leu Thr Gly Lys 
675 680 685 

gga ggc atg etc gcg att gee ttg ccg gaa age gcg gtg gtg aag cga 2112 
Gly Gly Met Leu Ala He Ala Leu Pro Glu Ser Ala Val Val Lys Arg 
690 695 700 

ate gca ggc ctg ccg gag ctg ace gtt gcg gcg gtc aac gga ccc ggc 2160 
He Ala Gly Leu Pro Glu Leu Thr Val Ala Ala Val Asn Gly Pro Gly 
705 710 715 720 

tec act gtc gtt tec ggc gaa ccg teg get ctg gag cgt ctg cag ace 2208 
Ser Thr Val Val Ser Gly Glu Pro Ser Ala Leu Glu Arg Leu Gin Thr 
725 730 735 

gaa ctg acc gcg gaa aac gtg cag ace egg egg gtg gga att gat tac 2256 
Glu Leu Thr Ala Glu Asn Val Gin Thr Arg Arg Val Gly He Asp Tyr 
740 745 750 

gec teg cat teg ccg cag ate gcg cag gtc cag ggc egg ctt ctg gac 2304 
Ala Ser His Ser Pro Gin He Ala Gin Val Gin Gly Arg Leu Leu Asp 
755 760 765 

egg ctg ggc gaa gtc ggg tec gaa cct get gag ate get ttc tac teg 2352 
Arg Leu Gly Glu Val Gly Ser Glu Pro Ala Glu lie Ala Phe Tyr Ser 
770 775 780 

acg gtc acc ggc gag egg acg gac acc ggc cga etc gac gee gac tac 2400 
Thr Val Thr Gly Glu Arg Thr Asp Thr Gly Arg Leu Asp Ala Asp Tyr 
785 790 795 800 

tgg tac cag aac ctt egg cag ccc gtc cgc ttc cag cag acc gtc gec 2448 
Trp Tyr Gin Asn Leu Arg Gin Pro Val Arg Phe Gin Gin Thr Val Ala 
805 810 815 

egg atg gca gat cag ggc tat egg ttc ttc gtc gag gtg age ccg cac 24 96 
Arg Met Ala Asp Gin Gly Tyr Arg Phe Phe Val Glu Val Ser Pro His 
820 825 830 

ccg ctg etc acc gee gga ate cag gaa acg ctg gaa gee gcg gac gcg 2544 
Pro Leu Leu Thr Ala Gly He Gin Glu Thr Leu Glu Ala Ala Asp Ala 
835 840 845 
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ggc ggg gtg gtg gtc ggt teg ctg egg cgt ggc gag ggc ggc tec egg 
Gly Gly Val Val Val Gly Ser Leu Arg Arg Gly Glu Gly Gly Ser Arg 
850 855 860 



2592 



cgc tgg ctg act teg ctg gee gag tgc cag gtg cgc gga ctg ccg gtg 
Arg Trp Leu Thr Ser Leu Ala Glu Cys Gin Val Arg Gly Leu Pro Val 
865 870 875 880 



.2640 



aat tgg gaa cag gta ttc etc aac acc gga gee cga cgc gtg ccg ctg 
Asn Trp Glu Gin Val Phe Leu Asn Thr Gly Ala Arg Arg Val Pro Leu 
885 890 695 



2688 



ccg acc tac ccg ttc cag egg cag egg tac tgg ttg gag tec gee gag 
Pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Glu Ser Ala Glu 
900 905 910 



2736 



tac gac gcg ggc gat etc ggt teg gtg ggc ttg etc tec gee gag cat 
Tyr Asp Ala Gly Asp Leu Gly Ser Val Gly Leu Leu Ser Ala Glu His 
915 920 925 



2784 



ccc ctg etc ggg get gcg gtg acg ctg gee gat gcg ggc ggg ttc ctg 
Pro Leu Leu Gly Ala Ala Val Thr Leu Ala Asp Ala Gly Gly Phe Leu 
930 935 940 



2832 



ctg acc ggc aag ctg teg gtc aag acc cag ccc tgg ttg gee gac cac 
Leu Thr Gly Lys Leu Ser Val Lys Thr Gin Pro Trp Leu Ala Asp His 
945 950 955 960 



2880 



gtg gtc ggc ggg gcg ate ctg ctg ccc ggc acc gcg ttc gtg gaa atg 
Val Val Gly Gly Ala lie Leu Leu Pro Gly Thr Ala Phe Val Glu Met 
965 970 975 



2928 



ctg ata cgc gee gcg gac cag gtc ggg tgc gat ctg ate gag gag ttg 
Leu lie Arg Ala Ala Asp Gin Val Gly Cys Asp Leu lie Glu Glu Leu 
980 985 990 



2976 



tec ctg acg act ccg ctg gtt ttg ccc gcg acc ggt gcg gtg cag gtg 
Ser Leu Thr Thr Pro Leu Val Leu Pro Ala Thr Gly Ala Val Gin Val 
995 1000 1005 



3024 



cag ate gcg gtt ggc ggt ccg gac gag gee ggg cgc cgc teg gtc cgc 
Gin lie Ala Val Gly Gly Pro Asp Glu Ala Gly Arg Arg Ser Val Arg 
1010 1015 1020 



3072 



gtg cat tec tgt cga gac gac gee gtg ccg cag gac teg tgg acc tgc 
Val His Ser Cys Arg Asp Asp Ala Val Pro Gin Asp Ser Trp Thr Cys 
1025 1030 1035 1040 



3120 
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cac gcg acc ggc acg ttg acc tec age gat cac cag gac gee ggc cag 
His Ala Thr Gly Thr Leu Thr Ser Ser Asp His Gin Asp Ala Gly Gin 
1045 1050 1055 



3168 



ggc ccc gat ggg att tgg ccg ccc aac gat get gtc gcg gtt ccg ctg 
Gly Pro Asp Gly lie Trp Pro Pro Asn Asp Ala Val Ala Val Pro Leu 
1060 1065 1070 



3216 



gac age ttc tac gee cgc gca get gag egg ggc ttc gat ttc ggc ccg 
Asp Ser Phe Tyr Ala Arg Ala Ala Glu Arg Gly Phe Asp Phe Gly Pro 
1075 1080 1085 



3264 



gcg ttc cag ggg ttg cag gcg get tgg aag cgc gga gac gag ate ttc 
Ala Phe Gin Gly Leu Gin Ala Ala Trp Lys Arg Gly Asp Glu lie Phe 
1090 1095 1100 



3312 



gee gag gtc ggc ctg ccc acc gca cac cgc gaa gac gee ggc agg ttc 
Ala Glu Val Gly Leu Pro Thr Ala His Arg Glu Asp Ala Gly Arg Phe 
1105 1110 1115 1120 



3360 



gga ate cac cct get ctg ctg gat gcg gca- ctg cag gcg ctg ggc gca 
Gly lie His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Leu Gly Ala 
1125 1130 1135 



3408 



gee gaa gag gat ccg gac gag gga tgg etc ccg ttc gcg tgg caa ggt 
Ala Glu Glu Asp Pro. Asp Glu Gly Trp Leu Pro Phe Ala Trp Gin Gly 
1140 1145 1150 



3456 



gtg tec etc aaa gcg acg ggc gca ctt tec ctt egg gtg cac etc gtt 
Val Ser Leu Lys Ala Thr Gly Ala Leu Ser Leu Arg Val His Leu Val 
1155 1160 1165 



3504 



ccg gcg ggc gcg aat gcg gtg teg gtg ttc acg acc gac acg act ggc 
Pro Ala Gly Ala Asn Ala Val Ser Val Phe Thr Thr Asp Thr Thr Gly 
1170 1175 1180 



3552 



caa gee gtg etc tec ate gat teg ctg gtg ctg cgc cag att teg gac 
Gin Ala Val Leu Ser lie Asp Ser Leu Val Leu Arg Gin lie Ser Asp 
1185 1190 1195 1200 



3600 



aag cag ttg gca gcg gee cgt gcg atg gaa cac gag tec ctg ttc egg 
Lys Gin Leu Ala Ala Ala Arg Ala Met Glu His Glu Ser Leu Phe Arg 
1205 1210 1215 



3648 



gtc gac tgg aag cga ate teg ccc ggc get gee aag ccg gtc tec tgg 
Val Asp Trp Lys Arg lie Ser Pro Gly Ala Ala Lys Pro Val Ser Trp 
1220 1225 1230 



3696 
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gca gtg ate ggc aat gac gaa etc gee cga gee tgc ggc teg gca ctt 3744 
Ala Val lie Gly Asn Asp Glu Leu Ala Arg Ala Cys Gly Ser Ala Leu 
1235 1240 1245 

ggc acg gaa etc cac ccc gac ctg acc ggg ttg get gac ccg ccc ccg 3792 
Gly Thr Glu Leu His Pro Asp Leu Thr Gly Leu Ala Asp Pro Pro Pro 
1250 1255 1260 

gac gtc gtg gtg gtg cca tgc ggt gcg tct cgc cag gac ttg gac gtt 3840 
Asp Val Val Val Val Pro Cys Gly Ala Ser Arg Gin Asp Leu Asp Val 
1265 1270 1275 1280 

get tec gag gca cgt gee gcg aca caa cgc atg ctt gac ctg ate cag 3 88 8 
Ala Ser Glu Ala Arg Ala Ala Thr Gin Arg Met Leu Asp Leu He Gin 
1285 1290 1295 

gat tgg ttg gcg gcg gcg cga ttc gee gga tct cgc ctg gtg gtt gtg 3936 
Asp Trp Leu Ala Ala Ala Arg Phe Ala Gly Ser Arg Leu Val Val Val 
1300 1305 1310 

acg tgt ggt gcg gcg teg aca ggt ccc gec gag ggt gtt tec gac ctg 3 984 
Thr Cys Gly Ala Ala Ser Thr Gly Pro Ala Glu Gly Val Ser Asp Leu 
1315 1320 1325 

gtg cat get gcg teg tgg ggt ttg ttg cgt teg gcg cag teg gag aac 4 032 
Val His Ala Ala Ser Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Asn 
1330 1335 1340 

ccg gac cga ttc gtg ttg gtc gat gtg gac gga acc gee gaa tea tgg 4080 
Pro Asp Arg Phe Val Leu Val Asp Val Asp Gly Thr Ala Glu Ser Trp 
1345 1350 1355 1360 

cgt gcg etc gcg gcg gee gtg cgt tec gga gaa ccg cag ctg gcg ttg 4128 
Arg Ala Leu Ala Ala Ala Val Arg Ser Gly Glu Pro Gin Leu Ala Leu 
1365 1370 1375 

cgc gee ggt gaa gtc egg gtg cct cgc ctg gcg cga tgt gtt gee gee 4176 
Arg Ala Gly Glu Val Arg Val Pro Arg Leu Ala Arg Cys Val Ala Ala 
1380 1385 1390 

gag gac age egg ate cca gtg ccc ggt gcg gat ggg acg gtg ttg att 4224 
Glu Asp Ser Arg He Pro Val Pro Gly Ala Asp Gly Thr Val Leu He 
1395 1400 1405 

tec ggc ggt acg ggc ctg ctg ggc ggg ttg gtt gee egg cat ttg gtg 4272 
Ser Gly Gly Thr Gly Leu Leu Gly Gly Leu Val Ala Arg His Leu Val 
1410 1415 1420 
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gcg gag cgc ggt gtc cgc cgc ctg gtg etc gcg ggg cga cgc ggc tgg 
Ala Glu Arg Gly Val Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp 
1425 1430 1435 1440 



4320 



age gee ccc ggg^ gtc acc gac ctg gtg gat gag ttg gtg ggc ctg gga 
Ser Ala Pro Gly Val Thr Asp Leu Val Asp Glu Leu Val Gly Leu Gly 
1445 1450 1455 



4368 



get gcg gtc gag gtg gcg age tgc gat gtc ggg gat egg gee cag ttg 
Ala Ala Val Glu Val Ala Ser Cys Asp Val Gly Asp Arg Ala Gin Leu 
1460 1465 1470 



4416 



gac egg ctg ctg acg acg ate teg gca gag ttc ccg ctg cgc gga gtg 
Asp Arg Leu Leu Thr Thr lie Ser Ala Glu Phe Pro Leu Arg Gly Val 
1475 1480 1485 



4464 



gtg cat gcg gee ggg gca ctt gee gac ggg gtc gtc gag teg ctg aca 
Val His Ala Ala Gly Ala Leu Ala Asp Gly Val Val Glu Ser Leu Thr 
1490 1495 1500 



4512 



cca gag cac gtg gca aag gtg ttc ggc ccg aag gee gee ggt gcg tgg 
Pro Glu His Val Ala Lys Val Phe Gly Pro Lys Ala Ala Gly Ala Trp 
1505 1510 1515 1520 



4560 



cac ctg cac gag ttg act ctt gat ctg gat etc teg ttc ttc gtg etc 
His Leu His Glu Leu Thr Leu Asp Leu Asp Leu Ser Phe Phe .Val Leu 
1525 1530 1535 



4608 



ttc tec teg ttc tec ggc gtg gcg ggg get gcg ggt cag gga aac tac 
Phe Ser Ser Phe Ser Gly Val Ala Gly Ala Ala Gly Gin Gly Asn Tyr 
1540 1545 1550 



4656 



gcg gcg gcg aac gcg ttc ctg gac ggc ctg get cag cac egg egg acg 
Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu Ala Gin His Arg Arg Thr 
1555 - 1560 1565 



4704 



gcg ggg ctg cct gcg gtg teg ctg get tgg ggc ttg tgg gag cag ccc 
Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro 
1570 1575 1580 



4752 



age ggg atg acc gga gcg etc gat gcg gcg ggc cgt age cgc att gcg 
Ser Gly Met Thr Gly Ala Leu Asp Ala Ala Gly Arg Ser Arg lie Ala 
1585 1590 1595 1600 



4800 



cgc acc aat ccg ccg atg tec gcg ccg gac ggg ttg egg ctg ttc gag 
Arg Thr Asn Pro Pro Met Ser Ala Pro Asp Gly Leu Arg Leu Phe Glu 
1605 1610 1615 



4848 
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atg gcg ttt cgc gtt ccg ggc gaa teg ctt ctg gtt ccg gtc cac gtc 
Met Ala Phe Arg Val Pro Gly Glu Ser Leu Leu Val Pro Val His Val 
1620 1625 1630 



4896 



gac ctg aac gec ctg cgc get gat -gcg gec gac ggc ggt gtg cct gcg 
Asp Leu Asn Ala Leu Arg Ala Asp Ala Ala Asp Gly Gly Val Pro Ala 
1635 1640 1645 



4944 



ttg ttg cgc gac ctg gtg cca gcg ccc gtg egg egg age gcg gtc aac 
Leu Leu Arg Asp Leu Val Pro Ala Pro Val Arg Arg Ser Ala Val Asn 
1650 1655 1660 



4992 



gag teg gcg gac gtc aac ggt ctg gtt ggt egg ctg egg agg ctg ccg 
Glu Ser Ala Asp Val Asn Gly Leu Val Gly Arg Leu Arg Arg Leu Pro 
1665 1670 1675 1680 



5040 



gac ctg gat cag gaa acc cag ctg ttg ggt ttg gtg cgc gag cat gtt 
Asp Leu Asp Gin Glu Thr Gin Leu Leu Gly Leu Val Arg Glu His Val 
1685 1690 1695 



5088 



teg gcg gtg ctg ggg cat teg ggt gcg gtc gag gtc ggg gee gat cgt 
Ser Ala Val Leu Gly His Ser Gly Ala Val Glu Val Gly Ala Asp Arg 
1700 1705 1710 



5136 



get ttc egg gat ttg ggt ttt gat teg ttg tec ggt gtg gag ttt egg 
Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Phe Arg 
1715 1720 1725 



5184 



aac egg ctt ggc ggg gtg ctg ggc gtt egg ttg ccg get act gcg gtg 
Asn Arg Leu Gly Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala Val 
1730 1735 1740 



5232 



ttc gac tat ccg aca ccg egg gcg ttg gtt egg ttc ttg etc gac aaa 
Phe Asp Tyr Pro Thr Pro Arg Ala Leu Val Arg Phe Leu Leu Asp Lys 
1745 1750 1755 1760 



5280 



ctg att ggt ggc gtg gag get ccg act ccc gca ccg gcg get gtg gcg 
Leu lie Gly Gly Val Glu Ala Pro Thr Pro Ala Pro Ala Ala Val Ala 
1765 1770 177-5 



5328 



gcg gtg act get gac gat ccc gtt gtg ate gtg ggg atg ggc tgt cgt 
Ala Val Thr Ala Asp Asp Pro Val Val lie Val Gly Met Gly Cys Arg 
1780 1785 1790 



5376 



tat ccg ggt ggg gtg tec teg ccg gag gag ctt tgg cgt ttg gtg gee 
Tyr Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala 
1795 1800 1805 



5424 
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ggg ggc ttg gat gcg gtg gcg gag ttc ccg gac gat cgt ggc tgg gat 54 72 

Gly Gly Leu Asp Ala Val Ala Glu Phe Pro Asp Asp Arg Gly Trp Asp 
1810 1815 1820 



cag gcg ggg ttg ttc gat ccg gat ccc gat cgt ctt ggg*acc teg tat 5520 
Gin Ala Gly Leu Phe Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr 
1825 1830 1835 1840 

gtg tgt gag ggt ggc ttc ctg cga gat gcg gca gag ttc gat gec ggt 5568 

Val Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Gly 
1845 1850 1855 

ttc ttc ggg att tec ccg cgt gag gcg ttg gcg atg gat ccg cag cag 5616 

Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin 
i860 1865 1870 



egg ttg ctg ctg gaa gtc get tgg gaa ace gtg gag egg gcg ggg att 5664 
Arg Leu Leu Leu Glu Val Ala Trp Glu Thr Val Glu Arg Ala Gly lie 
1875 1880 1885 

gat ccg ctt teg ttg egg ggg age egg ace ggc gtg ttc gcg ggg ctg 5712 
Asp Pro Leu Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu 
1890 1895 1900 

atg cac cac gac tac ggc gcg egg ttc ate acg agg gcg ccg gag ggt 5760 
Met His His Asp Tyr Gly Ala Arg Phe He Thr Arg Ala Pro Glu Gly 
1905 1910 1915 1920 



ttc gag ggt tat eta ggt aat ggc age gcg gga ggc gtg ttt teg ggt 5808 

Phe Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Gly Val Phe Ser Gly 
1925 1930 1935 

egg gtt gcg tat teg ttt ggt ttc gag ggt cct gcg gtg acg gtg gat 5856 

Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp 
1940 . 1945 1950 

acg gcg tgt teg teg teg ttg gtg gcg ctg cac ctg gcg ggt caa gca 5904 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ala 
1955 1960 1965 



ctg egg tct ggt gag tgt gat ctg get ctt gcg ggt ggt gtg acg gtg 5952 
Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val 
1970 1975 1980 



atg gec acg ccg ggg atg ttc gtg gag ttt teg cgt caa egg ggc ttg 6000 
Met Ala Thr Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
1985 1990 1995 2000 
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gcg gcg gat ggg egg tgc aag teg ttt gcg gcg get gcg gat ggc acc 
Ala Ala Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr 
2.005 2010 2015 



6048 



ggt tgg gga gaa ggc gcg' ggc ttg gtg ttg ttg gag egg ctg teg gat 
Gly Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp 
2020 2025 2030 



6096 



gec egg cgc aac ggg cac gcg gtt ctg gcg gtc gtg egg ggt age gcg 
Ala Arg Arg Asn Gly His Ala Val Leu Ala Val Val Arg Gly Ser Ala 
2035 2040 2045 



6144 



gtg aat cag gat ggt gcg teg aat ggt ttg acg gcg ccg aat ggg ccc 
val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
2050 2055 2060 



6192 



teg cag cag egg gtg ate acg cag gcg ttg gcg agt get ggt ttg teg 
Ser Gin Gin Arg Val lie Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser 
2065 2070 2075 2080 



6240 



gtg tct gat gtg gac gee gtg gag gcg cat ggg act gga acc agg ctt 
Val Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
2085 2090 2095 



6288 



ggt gat ccg att gag gcg cag get ctg att gec act tac ggg cag ggg 6336 
Gly Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr Gly Gin Gly 
2100 2105 2110 

egg gat age gat egg ccg ttg tgg ttg ggg teg gtg aag teg aat att 6384 
Arg Asp. Ser Asp Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He 
2115 2120 2125 

ggt cat acg cag gcg gcg gcg ggt gtc get ggt gtg ate aag atg gtg 6432 
Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val 
2130 2135 2140 

atg gcg atg egg cac ggg cag ctg ccc gcg acg ttg cat gtg gat gaa 6480 
Met Ala Met Arg His Gly Gin Leu Pro Ala Thr Leu His Val Asp Glu 
2145 2150 2155 » 2160 

cct acg teg gaa gtg gat tgg teg gcg ggg gat gtc cag etc etc acg 6528 
Pro Thr Ser Glu Val Asp Trp Ser Ala Gly Asp Val Gin Leu Leu Thr 
2165 2170 2175 

gag aac acc ccc tgg ccc ggc aac age cat cct egg egg gtg ggc gtg 6576 
Glu Asn Thr Pro Trp Pro Gly Asn Ser His Pro Arg Arg Val Gly Val 
2180 2185 2190 
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teg teg ttc ggg ate age ggc ace aac gca cac gtc ate etc gaa caa 6624 
Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Val He Leu Glu Gin 
2195 2200 2205 



gee teg aaa aca cca gac gag act gcg gac aag age ggt cee gat teg 6672 
Ala Ser Lys Thr Pro Asp Glu Thr Ala Asp Lys Ser Gly Pro Asp Ser 
2210 2215 2220 

gaa teg ace gtg gac ctt cca gcg gtc ccg ttg ate gtg teg ggg aga 6720 
Glu Ser Thr Val Asp Leu Pro Ala Val Pro Leu He Val Ser Gly Arg 
2225 2230 2235 2240 



aca ccg gca gcg etc age get cag gcg age gca ttg ttg tec tat ttg 6768 
Thr Pro Ala Ala Leu Ser Ala Gin Ala Ser Ala Leu Leu Ser Tyr Leu 
2245 2250 2255 

ggt gag cgt ggc gat att tec acg ctg gat gcg gcg ttt teg ttg get 6816 
Gly Glu Arg Gly Asp He Ser Thr Leu Asp Ala Ala Phe Ser Leu Ala 
2260 2265 2270 



tec tec egg gec gcg ttg gag gag egg gcg gtg gtg ctg gga gcg gac 6864 
Ser Ser Arg Ala Ala Leu Glu Glu Arg Ala Val Val Leu Gly Ala Asp 
2275 2280 2285 

cgc gaa acg ttg ttg tec ggg ttg gaa gcg ctg get tec ggt cgc gag 6912 
Arg Glu Thr Leu Leu Ser Gly Leu Glu Ala Leu Ala Ser Gly Arg Glu 
2290 2295 2300 

get tct ggg gtg gtg teg gga tec ccg gtc tct ggc ggg gtt ggg ttc 6960 
Ala Ser Gly Val Val Ser Gly Ser Pro Val Ser Gly Gly Val Gly Phe 
2305 2310 2315 2320 



gtg ttc gec ggt cag ggc gga cag tgg ttg ggg atg ggc egg ggg etc 7008 
Val Phe Ala Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu 
2325 2330 2335 



tac teg gtt ttt ccg gtg ttc get gac gcg ttt gac gaa gca tgt gee 7056 
Tyr Ser Val Phe Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala 
2340 2345 2350 

gga ctg gac gcg cat ctg ggg cag gac gtg ggg gtc egg gat gtg gtg 7104 
Gly Leu Asp Ala His Leu Gly Gin Asp Val Gly Val Arg Asp Val Val 
2355 2360 2365 



ttt ggt tec gac ggg tec ttg ttg gat egg acg ctg tgg gee cag teg 7152 
Phe Gly Ser Asp Gly Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser 
2370 2375 2380 
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ggt ttg ttc gcg ttg cag gtt ggt ttg ctg age ctg ctg ggt teg tgg 
Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Ser Leu Leu Gly Ser Trp 
2385 2390 2395 2400 



7200 



ggt gtc egg ccg ggt gtg gtg ctg ggc cat teg gtc ggc gag ttc gcg 
Gly Val Arg Pro Gly Val Val Leu Gly His Ser Val Gly Glu Phe Ala 
2405 2410 2415 



7248 



gcg gcg gtt gcg gcg gga gtg ttg teg ttg ccg gat gcg get egg atg 
Ala Ala Val Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala Arg Met 
2420 2425 2430 



7296 



gtg gcg ggt cgt gee egg ttg atg cag gcg ttg cct tct ggc ggt gec 
Val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu Pro Ser- Gly Gly Ala 
2435 2440 2445 



7344 



atg ttg gcg gtg get get ggt gag gag cag ctg egg ccg ttg ttg gee 
Met Leu Ala Val Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala 
2450 2455 2460 



7392 



gat egg gtt gat ggt gcg ggt ate gee gcg gtc aac get cct gag teg 
Asp Arg Val Asp Gly Ala Gly lie Ala Ala Val Asn Ala Pro Glu Ser 
2465 2470 2475 2480 



7440 



9tg gtg etc tec ggc gat egg gag gtg ctt gac gac ate gee ggc gcg 
Val Val Leu Ser Gly Asp Arg Glu Val Leu Asp Asp lie Ala Gly Ala 
2485 2490 2495 



7488 



ctg gat ggg caa ggg att egg tgg egg egg ttg egg gtt teg cat gcg 
Leu Asp Gly Gin Gly lie Arg Trp Arg Arg Leu Arg Val Ser His Ala 
2500 2505 2510 



7536 



ttt cat teg tat egg atg gac ccg atg ttg cag gag ttc gee gaa ate 
Phe His Ser Tyr Arg Met Asp Pro Met Leu Gin Glu Phe Ala Glu He 
2515 2520 2525 



7584 



gca cgc age gtg gac tac egg cgt ggc gac eta ccg gtc gtg teg acg 
Ala Arg Ser Val Asp Tyr Arg Arg Gly Asp Leu Pro Val Val Ser Thr 
2530 2535 2540 



7632 



ttg acg ggt gag etc gac ace gca ggt gtg atg get acg ccg gag tat 

Leu Thr Gly Glu Leu Asp Thr Ala Gly Val Met Ala Thr Pro Glu Tyr 

2545 2550 2555" 2560 

tgg gtg cgt cag gtt cga gag ccc gtc cgc ttc gee gac ggc gtc egg 

Trp Val Arg Gin Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Arg 

2565 2570 2575 



7680 



7728 
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gtg etc gcg cag caa ggg gtc gec acg ate ttc gaa etc ggc cct gat 
Val Leu Ala Gin Gin Gly Val Ala Thr He Phe Glu Leu Gly Pro Asp 
2580 2585 2590 



7776 



gcg accj dtg teg. gee ctg att ccc gat tgt cat teg tgg get gat cag 
Ala Thr Leu Ser Ala Leu He Pro Asp Cys His Ser Trp Ala Asp Gin 
2595 2600 2605 



7824 



gee atg ccg att ccg atg ctg cgt aaa gac cgt acg gaa ace gaa act 
Ala Met Pro He Pro Met Leu Arg Lys Asp Arg Thr Glu Thr Glu Thr 
2610 2615 2620 



7872 



gtg gtc gee gcg gtg gcg egg gcg cac acg cgt ggt gtt ccg gtc gaa 
Val Val Ala Ala Val Ala Arg Ala His Thr Arg Gly Val Pro Val Glu 
2625 2630 2635 2640 



7920 



tgg teg gcg tat ttc gee ggc ace ggg gca egg egg gtc gag ttg ccg 
Trp Ser Ala Tyr Phe Ala Gly Thr Gly Ala Arg Arg Val Glu Leu Pro 
2645 . 2650 2655 



7968 



acg tat gee ttc cag egg cag egg tac tgg ctg gaa aca teg gat tac 
Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Glu Thr Ser Asp Tyr 
2660 2665 2670 



8016 



ggc gat gtg acg ggt ate ggc ctg get gcg gcg gag cat ccg ttg ctg 
Gly Asp Val Thr Gly He Gly Leu Ala Ala Ala Glu His Pro Leu Leu 
2675 2680 2685 



8064 



ggg gee gtg gtt gcg ctg gee gat ggt gat ggg atg gtg ctg ace ggc 
Gly Ala Val Val Ala Leu Ala Asp Gly Asp Gly Met Val Leu Thr Gly 
2690 2695 2700 



8112 



egg ttg teg gtg ggg acg cat ccg tgg ctg gee cag cat cgc gtg ctg 
Arg Leu Ser Val Gly Thr His Pro Trp Leu Ala Gin His Arg Val Leu 
2705 2710 2715 2720 



8160 



ggc gag gtc gtc gtc ccc ggc ace gee ate ctg gag atg gee ctg cac 
Gly Glu Val Val Val Pro Gly Thr Ala He Leu Glu Met Ala Leu His 
2725 2730 273-5 



8208 



gca ggg gcg cgt etc ggc tgt gac egg gtg gaa gag etc acc ctg gaa 
Ala Gly Ala Arg Leu Gly Cys Asp Arg Val Glu Glu Leu Thr Leu Glu 
2740 2745 2750 



8256 



aca ccg ctg gtg gtc ccc gaa cgc gcg gcg ggt gee ggt agt cgt ggc 
Thr Pro Leu Val Val Pro Glu Arg Ala Ala Gly Ala Gly Ser Arg Gly 
2755 2760 2765 



8304 
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cct gcg gga ggg acc aca gtt tea att gaa act gcg gaa gaa cgt gtg 8352 
Pro Ala Gly Gly Thr Thr Val Ser He Glu Thr Ala Glu Glu Arg Val 
2770 2775 2780 



egg acg aac gac gec ate gaa ate cag ctg ctg gtg aac gca ccc gac 64 00 
Arg Thr Asn Asp Ala lie Glu He Gin Leu Leu Val Asn Ala Pro Asp 
2785 2790 2795 2800 



gaa ggc ggt egg cga agg gtg teg ctg tat tec cgc ccg gee ggt ggg 8448 

Glu Gly Gly Arg Arg Arg Val Ser Leu Tyr Ser Arg Pro Ala Gly Gly 
2805 2810 2815 

teg aga ggt ggg ggt tgg acg cgc cac gee acc ggc gaa etc gtc gtc 84 96 

Ser Arg Gly Gly Gly Trp Thr Arg His Ala Thr Gly Glu Leu Val Val 
2820 2825 2830 



ggc acc acc ggt ggt agg gcg gtt cct gat tgg teg get gag ggt gee 8544 
• Gly Thr Thr Gly Gly Arg Ala Val Pro Asp Trp Ser Ala Glu Gly Ala 
2835 2840 2845 



gag teg att get etc gat gag ttc tac gtc get ctg gee gga aac ggg . 8592 
Glu Ser He Ala Leu Asp Glu Phe Tyr Val Ala Leu Ala Gly Asn Gly 
2850 2855 2860 

ttc gag tac ggg ccg ttg ttc cag ggg ctt cag gcg gca tgg cgt cgt 8640 
Phe Glu Tyr Gly Pro Leu Phe Gin Gly Leu Gin Ala Ala Trp Arg Arg 
2865 2870 2875 2880 



ggt gac gag gtt etc gee gaa ate gee ccg ccg gee gag gee gat gcg 8688 
Gly Asp Glu Val Leu Ala Glu He Ala Pro Pro Ala Glu Ala Asp Ala 
2885 2890 2895 



atg gcg teg gga tac ctg etc gac cca gcg ttg ctg gat gee gcg ctg 8736 
Met Ala Ser Gly Tyr Leu Leu Asp Pro Ala Leu Leu Asp Ala Ala Leu 
2900 2905 2910 

cag gcg tec gcg etc ggc gac cgc ccg gag caa ggc. ggc gcg tgg ctg 8784 
Gin Ala Ser Ala Leu Gly Asp Arg Pro Glu Gin Gly Gly Ala Trp Leu 
2915 2920 2925 

ccg ttc tea ttc acc ggc gtc gaa ctt tec get ccg gca ggg acg ate 8832 
Pro Phe Ser Phe Thr Gly Val Glu Leu Ser Ala Pro Ala Gly Thr He 
2930 2935 2940 

age agg gtg egg ctg gag acc agg cga ccc gac gcg ata teg gtg gee 8880 
Ser Arg Val Arg Leu Glu Thr Arg Arg Pro Asp Ala He Ser Val Ala 
2945 2950 2955 2960 
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gtg atg gat gag agt ggg egg ttg etc gec teg ate gat tct etc agg 
Val Met Asp Glu Ser Gly Arg Leu Leu Ala Ser He Asp Ser Leu Arg 
2965 2970 2975 



eta cga age gtg teg teg gga cag ctg gcg aat~ egg gac get gtc cgc 
Leu Arg Ser Val Ser Ser Gly Gin Leu Ala Asn Arg Asp Ala Val Arg 
2980 2985 2990 



gac gcg ctg ttc gag gtg ace tgg gag ccg gtg gcg acg cag teg acg 
Asp Ala Leu Phe Glu Val Thr Trp Glu Pro Val Ala Thr Gin Ser Thr 
2995 3000 3005 

gaa ccg ggt cgc tgg gee ctg ctt ggt gat act gee tgc ggt aaa gac 
Glu Pro Gly Arg Trp Ala Leu Leu Gly Asp Thr Ala Cys Gly Lys Asp 
3010 3015 3020 



gat etc ate aaa etc gca acg gat tec gee gac cgc tgc gcg gat ctg 
Asp Leu lie Lys Leu Ala Thr Asp Ser Ala Asp Arg Cys Ala Asp Leu 
3025 3030 3035 3040 

gcg gcg eta gee gag aaa ctt gat tec age gcg ctg gtt cct gat gtc 
Ala Ala Leu Ala Glu Lys Leu Asp Ser Ser Ala Leu Val Pro Asp Val 
3045 3050 3055 

gtg gtc tac tgc gee gga gaa cag gcg gat ccc ggc ace ggc gca gec 
Val Val Tyr Cys Ala Gly Glu Gin Ala Asp Pro Gly Thr Gly Ala Ala 
3060 3065 3070 

gca ctt gcg gag ace cag cag acg ttg get ctg etc caa gcg tgg ttg 
Ala Leu Ala Glu Thr Gin Gin Thr Leu Ala Leu Leu Gin Ala Trp Leu 
3075 3080 3085 

get gag ccg egg ttg gee gag gca cgt ctg gtg gtg gtg acg tgt gca 
Ala Glu Pro Arg Leu Ala Glu Ala Arg Leu Val Val Val Thr Cys Ala 
3090 3095 3100 



gcg gtg acg acg get ccg agt gac 
Ala Val Thr Thr Ala Pro Ser Asp 
3105 3110 



ggt gca tea gag ctg gca cat gcg 
Gly Ala Ser Glu Leu Ala His Ala 
3115 3120 



ccg ttg tgg ggg ttg ttg cgt gee gcg cag gtg gag aac ccg ggg cag 
Pro Leu Trp Gly Leu Leu Arg Ala Ala Gin Val Glu Asn Pro Gly Gin 
3125 3130 3135 

ttt gtg ctg gcg gac gtc gac gga ace gec gaa teg tgg cgt gcg ttg 
Phe Val Leu Ala Asp Val Asp Gly Thr Ala Glu Ser Trp Arg Ala Leu 
3140 3145 3150 



8928 



8976 



9024 



9072 



9120 



9168 



9216 



9264 



9312 



9360 



9408 



9456 
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ccg agt gcg ttg ggc teg atg gaa ccg cag ttg gec ctg egg aag ggc 9504 

Pro Ser Ala Leu Gly Ser Met Glu Pro Gin Leu Ala Leu Arg Lys Gly 
3155 3160 3165 

gcg gtg cga gcg ccc* cgc ttg get teg gtc gec ggg cag ate gac gtg 9552 

Ala Val Arg Ala Pro Arg Leu Ala Ser Val Ala Gly Gin He Asp Val 
3170 3175 3180 



ccc gcg gtt gtg gcg gat ccc gac cga acc gtg ctg att teg ggc ggc 9600 
Pro Ala Val Val Ala Asp Pro Asp Arg Thr Val Leu He Ser Gly Gly 
3185 3190 3195 3200 

acg ggc ctg ttg ggg ggc gcg gtt gee cgc cac ctg gtg acc gaa cgc 9648 
Thr Gly Leu Leu Gly Gly Ala Val Ala Arg His Leu Val Thr Glu Arg 
3205 3210 3215 

ggt gtc cgc cga ttg gtg ttg acg ggc cgt cgt ggc tgg gat get cct 9696 
Gly Val Arg Arg Leu Val Leu Thr Gly Arg Arg Gly Trp Asp Ala Pro 
3220 3225 3230 



gga ate acc gag ttg gtg ggt gag ctg aac ggc etc ggt gee gtg gtc 9744 
Gly He Thr Glu Leu Val Gly Glu Leu Asn Gly Leu Gly Ala Val Val 
3235 3240 3245 

gac gtg gtg gcg tgc gac gtc gcg gat cgt get gat ctg gag teg ttg 9792 
Asp Val Val Ala Cys Asp Val Ala Asp Arg Ala Asp Leu Glu Ser Leu 
3250 3255 3260 



ctg gcg gcg gtc ccg gcg gaa ttt ccg ttg tgc ggc gtg gtg cat gee 984 0 
Leu Ala Ala Val Pro Ala Glu Phe Pro Leu Cys Gly Val Val His Ala 
3265 3270 3275 3280 

gcg ggg gcg ctg gee gac ggg gtg ate gag teg ttg tea ccg gac gac 9888 
Ala Gly Ala Leu Ala Asp Gly Val He Glu Ser Leu Ser Pro Asp Asp 
3285 3290 3295 

gtg gga gcg gtg ttc ggc ccg aag gcg gcg ggg gcg tgg aat ctg cac 9936 
Val Gly Ala Val Phe Gly Pro Lys Ala Ala Gly Ala Trp Asn Leu His 
3300 3305 3310 



gag ctg act cgt gat acg gac ctg teg ttc ttc gcg ttg ttc tec teg 9984 
Glu Leu Thr Arg Asp Thr Asp Leu Ser Phe Phe Ala Leu Phe Ser Ser 
3315 . 3320 3325 

ctt tec ggt gtt gee ggc get cct ggt cag ggc aat tat gcg gcg gcg 10032 
Leu Ser Gly Val Ala Gly Ala Pro Gly Gin Gly Asn Tyr Ala Ala Ala 
3330 -3335 3340 
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aac gcg ttc ctg gac gca ttg gcg cat tac egg egg tea cag gga ctg 
Asn Ala Phe Leu Asp Ala Leu Ala His Tyr Arg Arg Ser Gin Gly Leu 
3345 3350 3355 3360 



10080 



cct gcg gtg teg ctg gec tgg ggc ctg tgg gag cag ccg age ggg atg 
Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro Ser Gly Met 
3365 3370 3375 



10128 



acg gag acg etc age gag gtc gac egg age agg ate gcg cgc gee aac 
Thr Glu Thr Leu Ser Glu Val Asp Arg Ser Arg lie Ala Arg Ala Asn 
3380 3385 3390 



10176 



ccg ccg ttg tec acc aag gag gga ttg egg ctg ttc gat gee ggg ctg 
Pro Pro Leu Ser Thr Lys Glu Gly Leu Arg Leu Phe Asp Ala Gly Leu 
3395 3400 3405 



10224 



gcg ctg gac egg gca gcg gta gtt ccg gcg aag ttg gac agg act ttc 10272 
Ala Leu Asp Arg Ala Ala Val Val Pro Ala Lys Leu Asp Arg Thr Phe 
3410 3415 3420 



ctg gee gag cag gcg egg teg ggc teg ctg ccc gca ttg ttg acg gca 10320 
Leu Ala Glu Gin Ala Arg Ser Gly Ser Leu Pro Ala Leu Leu Thr Ala 
3425 3430 3435 3440 



ctg gta ccc ccc ate cgt cgt aat agg egg get age gga acc gag etc 10368 
Leu Val Pro Pro lie Arg Arg Asn Arg Arg Ala Ser Gly Thr Glu Leu 
3445 3450 3455 



gcg gac gag ggc acc ctg etc ggg gtg gtg egg gag cat gee gcg gee 10416 
Ala Asp Glu Gly Thr Leu Leu Gly Val Val Arg Glu His Ala Ala Ala 
3460 3465 3470 



gtg ctg ggg tat teg age gcg get gac gtc ggg gtc gag cgc get ttc 104 64 
Val Leu Gly Tyr Ser Ser Ala Ala Asp Val Gly Val Glu Arg Ala Phe 
3475 3480 3485 



egg gat ctg ggt ttt gat teg ttg tct ggt gtg gag ttg egg aac cgc 10512 
Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Leu Arg Asn Arg 
3490 3495 3500 

ctt gee ggg gtg ctg ggg gtg egg ttg ccg gcg act gcg gtg ttc gac 10560 
Leu Ala Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala Val Phe Asp 
3505 3510 3515 3520 

tat ccg acg ccg agg gcg ctg gee egg ttc ctg cac cag gaa ctg gca 10608 
Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu Leu Ala 
3525 3530 3535 
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gac gag ate get acg acg cca gcg ccg gtg acg acg ace agg gca ccg 10656 

Asp Glu He Ala Thr Thr Pro Ala Pro Val Thr Thr Thr Arg Ala Pro 
3540 3545 3550 

gtc gee gaa gac gat etc gtc gcg ata gtc ggg atg gga tgc cgt ttt 10704 

Val Ala Glu Asp Asp Leu Val Ala He Val Gly Met Gly Cys Arg Phe 
3555 3560 3565 

ccc ggt cag gtg tec teg ccg gag gag etc tgg cgt ttg gtg gec ggg 10752 

Pro Gly Gin Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly 
3570 3575 3580 

ggc gtg gat gcg gtc gcg gac ttc cca gee gat cgc ggc tgg gat ctg 10800 

Gly Val Asp Ala Val Ala Asp Phe Pro Ala Asp Arg Gly Trp Asp Leu 
3585 3590 3595 3600 

gca ggc ttg ttc gat. ccg gac ccg gaa egg get ggg aag acc tac gtg 1084 8 

Ala Gly Leu Phe Asp Pro Asp Pro Glu Arg Ala Gly Lys Thr Tyr Val 

3605 3610 3615 

egg gaa ggg gee ttc etc acc gac gee gat egg ttc gat gcg ggt ttc 10896 

Arg Glu Gly Ala Phe Leu Thr Asp Ala Asp Arg Phe Asp Ala Gly Phe 
3620 3625 3630 

ttc ggg att tec ccg cgt gag gcg ttg gcg atg gat ccg cag caa egg 10944 

Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg ■ 
3635 3640 3645 

c.tg ttg ctg gag ctg tec tgg gag gee att gaa egg gca ggg ate gat 10992 

Leu Leu Leu Glu Leu Ser Trp Glu Ala He Glu Arg Ala Gly He Asp 
3650 3655 3660 

ccg ggt teg ctg agg ggg agt egg acc ggt gtg ttc gcg ggg ctg atg 11040 

Pro Gly Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met 
3665 3670 3675 3680 

tac cac gac tat ggc gec egg ttc gee age cga gee ccg gaa ggt ttc 11088 

Tyr His Asp Tyr Gly Ala Arg Phe Ala Ser Arg Ala Pro Glu Gly Phe 

3685 3690 3695 

gag ggg tat etc ggc aat ggc agt get ggg agt gtc gcg teg ggc egg 1113 6 

Glu Gly Tyr Leu Gly Asn Gly Ser Ala .Gly Ser Val Ala Ser Gly Arg 
3700 3705. 3710 

att gcg tac teg ttt ggt ttc gag ggt cct gcg gtg acg gtg gat act 11184 

He Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
3715 3720 3725 
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gcg tgt teg teg teg ttg gtg gcg ttg cat ttg gcg ggt cag teg ttg 11232 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ser Leu 
3730 3735 3740 



cgt tec ggc gaa tgc gat etc gee ctt gee ggt ggt gtg acg- gtg atg 112 80 
Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met 
3745 3750 3755 3760 

teg acg ccc ggg acg ttt gtg gaa ttc tec cgt cag egg ggc ctg gca 11328 
Ser Thr Pro Gly Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
3765 3770 3775 

ccg gac ggg egg tgc aag teg ttc gcg gag age gcg gac ggt acc ggt 11376 
Pro Asp Gly Arg Cys Lys. Ser Phe Ala Glu Ser Ala Asp Gly Thr Gly 
3780 3785 3790 

tgg ggt gag ggt get ggt ttg gtg ttg ttg gag egg ttg teg gat get 11424 
Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala 
3795 3800 3805 

egg egg aat ggg cat egg gtg ttg gcg gtg gtt cgt ggg teg gcg gtg 11472 
Arg Arg Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val 
3810 3815 3820 

aat cag gat ggt gcg teg aat ggc ttg acc gcg ccg aat ggt ccc teg 11520 
Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
3825 3830 3835 3840 

cag cag egg gtc ate cag cag gcg ttg gcg agt gcg ggt ctg teg gtg 11568 
Gin Gin Arg Val lie Gin Gin Ala Leu Ala Ser Ala Gly Leu Ser Val 
3845 3850 3855 

tec gat gtg gat gee gtg gag gcg cat ggg acc ggg acc agg ttg ggt 11616 
Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
3860 3865 3870 

gat ccg att gag gcg cag get ctg att get acg tat ggg cgc gat cgt 11664 
Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr Gly Arg Asp Arg 
3875 3880 3885 



gat ccc ggt egg ccg ttg tgg ttg ggg 
Asp Pro Gly Arg Pro Leu Trp Leu Gly 
3890 3895 

cat acg cag gcg gcg gcg ggt gtt gee 
His Thr Gin Ala Ala Ala Gly Val Ala 
3905 3910 



teg gtg aag tec aac ate ggt 11712 
Ser Val Lys Ser Asn He Gly 
3900 

ggt gtg ate aag atg gtg atg 11760 
Gly Val He Lys Met Val Met 
3915 3920 
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gcg atg egg cac ggg caa ctt ccg cgc acg ctg cac gtg gat gca ccc 11808 

Ala Met Arg His Gly Gin Leu Pro Arg Thr Leu His Val Asp Ala Pro 

3925 3930 3935 



tec teg cag gtg gat tgg teg gcg ggg agg gtc cag etc ctg acg gag 11856 

Ser Ser Gin Val Asp Trp Ser Ala Gly Arg Val Gin Leu Leu Thr Glu 
3940 3945 3950 

aac acg ccc tgg ccc gac agt ggt cgc ccc tgt egg gtg ggg gtg teg 11904 

Asn Thr Pro Trp Pro Asp Ser Gly Arg Pro Cys Arg Val Gly Val Ser 
3955 3960 3965 



teg ttc ggg ate age ggc ace aac gcg 
Ser Phe Gly lie Ser Gly Thr Asn Ala 
3970 3975 

acg ggg cag atg gat cag gca gcg gag 
Thr Gly Gin Met Asp Gin Ala Ala Glu 
3985 3990 

gat gtt ccg gtg gtg ccg tgg gtg gtg 
Asp Val Pro Val Val Pro Trp Val Val 
4005 



cac gtc ate ctg gaa cag tec 11952 
His Val lie Leu Glu Gin Ser 
3980 

ccg gat teg agt cct gtt ctg 12000 
Pro Asp Ser Ser Pro Val Leu 
3995 4000 

teg ggc aaa aca ccc gaa gcg 12048 
Ser Gly Lys Thr Pro Glu Ala 
010 4015 



eta tec gee cag gcg gca acg ttg gcg acc tat ttg gac caa aat gtt 12096 
Leu Ser Ala Gin Ala Ala Thr Leu Ala Thr Tyr Leu Asp Gin Asn Val 
4020 4025 4030 



gat gtc tec cct ctg gac gtt ggg att teg ctt gcg gtg acc cgt teg 12144 

Asp Val Ser Pro Leu Asp Val Gly lie Ser Leu Ala Val Thr Arg Ser 
4035 4040 4045 

gcg ctg gat gag egg gcg gtg gtg ctg ggg teg gat cgt gac acg ttg 12192 

Ala Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Asp Thr Leu 
4050 4055 4060 



ttg tct ggc ctg aat gcg ctg get 
Leu Ser Gly Leu Asn Ala Leu Ala 
4065 4070 

gtt acg gga cct gtc ggg att ggt 
Val Thr Gly Pro Val Gly lie Gly 
4085 



gee ggt cat gag get get ggc gtg 12240 
Ala Gly His Glu Ala Ala Gly Val 
4075 ■ 4080 

ggc egg acc ggg ttt gtg ttc gee 12288 
Gly Arg Thr Gly Phe Val Phe Ala 
4090 4095 



ggt .caa ggc ggt cag tgg ttg ggg atg ggc cgc egg ttg tac teg gag 12336 
Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Arg Leu Tyr Ser Glu 
4100 4105 4110 
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ttt ccg gcg ttc gcc ggt get ttc gac gaa gca tgc gec gag etc gat 12384 
Phe Pro Ala Phe Ala Gly Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp 
4115 4120 4125 

gcg aac ctg ggg agg gaa gtc ggg gtt egg gat gtg gtg ttc ggc tec 124 32 
Ala Asn Leu Gly Arg Glu Val Gly Val Arg Asp Val Val Phe Gly Ser 
4130 4135 4140 

gac gag tec ttg ctg gat egg act ttg tgg gcg cag teg ggt ttg ttc 12480 
Asp Glu Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe 
4145 4150 4155 4160 

gcg ttg cag gtc ggt etc tgg gaa ttg ttg ggt acg tgg ggt gtt egg 12528 
Ala Leu Gin Val Gly Leu Trp Glu Leu Leu Gly Thr Trp Gly Val Arg 
4165 4170 4175 

ccc age gta gtg ctg ggg cat teg gtc ggg gag eta gcc gcg gcg ttc 12576 
Pro Ser Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe 
4180 4185 4190 

gcc gca ggt gtg ctg teg atg gcg gag gcg get egg ctg gtg gcg ggt 12624 
Ala Ala Gly Val Leu Ser Met Ala Glu Ala Ala Arg Leu Val Ala Gly 
4195 4200 4205 

cgt gcg egg ttg atg cag gcg ttg cct tct ggc ggt gcc atg ctg gcg 12672 
Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala 
4210 4215 4220 

gtg tec gcg ace gag gcc cga gtc ggc ccg ctg etc gat ggg gtg egg 12720 
Val Ser Ala Thr Glu Ala Arg Val Gly Pro Leu Leu Asp Gly Val Arg 
4225 4230 4235 4240 

gat cgt gtt ggt gtc gca gcg gtt aac get ccg ggg teg gtg gtg ctt 12768 
Asp Arg Val Gly Val Ala Ala Val Asn Ala Pro Gly Ser Val Val Leu 
4245 4250 4255 

tec ggt gac egg gat gtg etc gat ggc att gcc ggt egg ctg gac ggg 12816 
Ser Gly Asp Arg Asp Val Leu Asp Gly lie Ala Gly Arg Leu Asp Gly 
4260 4265 4270 

caa ggt ate egg teg agg tgg ttg egg gtt teg cac gcg ttt cat teg 12864 
Gin Gly He Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser 
4275 4280 4285 

cat egg atg gat ccg atg ctg gcg gag ttc gcc gag etc gca egg age 12912 
His Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Ser 
4290 4295 4300 
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gtg gac tac egg tct cca egg ctg ccg att gtc teg acg ctg ace gga 12960 
Val Asp Tyr Arg Ser Pro Arg Leu Pro He Val Ser Thr Leu Thr Gly 
4305 4310 4315 4320 

aac etc gat gac gtg ggc gtg atg get acg ccg gag tat tgg gtg cgc 13008 
Asn Leu Asp Asp Val Gly Val Met Ala Thr Pro Glu Tyr Trp Val Arg 
4325 4330 4335 

cag gtg cga gag ccc gtc cgc ttc gee gac ggt gtc cag gcg ctt gtg 13056 
Gin Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Gin Ala Leu Val 
4340 4345 4350 

gac caa ggc gtc gac acg att gtg gaa etc ggt ccg gac ggg gcg ttg 13104 
Asp Gin Gly Val Asp Thr He Val Glu Leu Gly Pro Asp Gly Ala Leu 
4355 4360 4365 

teg age ttg gtt caa gag tgt gtg gcg gag tec ggg egg gcg acg ggg 13152 
Ser Ser Leu Val Gin Glu Cys Val Ala Glu Ser Gly Arg Ala Thr Gly 
4370 4375 4380 

att ccg ttg gtg egg aga gac cgt gat gag gtc cga acg gtg ctg gac 13200 
He Pro Leu Val Arg Arg Asp Arg Asp Glu Val Arg Thr Val Leu Asp 
4385 4390 4395 4400 

get ttg gcg cag ace cac act cgt ggt ggc gcg gtg gac tgg ggg tea 13248 
Ala Leu Ala Gin Thr His Thr Arg Gly Gly Ala Val Asp Trp Gly Ser 
4405 4410 4415 

ttt ttc get ggt acg agg gca acg caa gtc gac ctt ccc acg tat gee 13296 
Phe Phe Ala Gly Thr Arg Ala Thr Gin Val Asp Leu Pro Thr Tyr Ala 
4420 4425 4430 

ttc caa cga cag egg tac tgg ctg gag cca teg gat tec ggt gat gtg 13 344 
Phe Gin Arg Gin Arg Tyr Trp Leu Glu Pro Ser Asp Ser Gly Asp Val 
4435 4440 4445 

acc ggt gtt ggc ctg ace ggg gcg gag cat ccg ctg ttg ggt gec gtg 13392 
Thr Gly Val Gly Leu Thr Gly Ala Glu His Pro Leu Leu Gly Ala Val 
4450 4455 4460 

gtg ccg gtc gcg ggc ggc gat gag gtg ctg ctg acc ggc agg ctg teg 1344 0 
Val Pro Val Ala Gly Gly Asp Glu Val Leu Leu Thr Gly Arg Leu Ser 
4465 4470 4475 4480 

gtg ggg acg cat ccg tgg ctg gcg gaa cac cgc gtg ctg ggc gaa gtc 13488 
Val Gly Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly Glu Val 
4485 4490 4495 
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gtc gtc ccc ggc acc gcg ttg ctg gag atg gcg tgg egg gec ggt age 
Val Val Pro Gly Thr Ala Leu Leu Glu Met Ala Trp Arg Ala Gly Ser 
4500 4505 4510 



13536 



cag gtc ggt tgt gaa cgt gtg gag gag etc acc ttg gag gca ccg ctg 
Gin Val Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala Pro Leu 
4515 4520 4525 



13584 



gtc ctg ccg gag egg ggc get gcg gcg gtg cag ttg gcg gtg ggg get 
Val Leu Pro Glu Arg Gly Ala Ala Ala Val Gin Leu Ala Val Gly Ala 
4530 4535 4540 



13632 



ccg gat gag gee ggc egg cgc agt ttg cag etc tat tec cga ggc get 
Pro Asp Glu Ala Gly Arg Arg Ser Leu Gin Leu Tyr Ser Arg Gly Ala 
4545 4550 4555 4560 



13680 



gat gaa gac ggc gac tgg egg egg att gee tec ggg ctg ttg gec cag 
Asp Glu Asp Gly Asp Trp Arg Arg He Ala Ser Gly Leu Leu Ala Gin 
4565 4570 4575 



13728 



gec aat gcg gtg ccg ccg gcg gat teg acg gca tgg ccg ccg gac ggc 
Ala Asn Ala Val Pro Pro Ala Asp Ser Thr Ala Trp Pro Pro Asp Gly 
4580 4585 4590 



13776 



gec ggg cag gtc gat ctg gcg gag .ttc tac gag cgc etc gec gag cgc 
Ala Gly Gin Val Asp Leu Ala Glu Phe Tyr Glu Arg Leu Ala Glu Arg 
4595 4600 4605 



13824 



ggc ttg acc tac ggt ccg gta ttc caa ggg etc cgc gec gca tgg egg 
Gly Leu Thr Tyr Gly Pro Val Phe Gin Gly Leu Arg Ala Ala Trp Arg 
4610 4615 4620 



13872 



cac ggc gac gat ate ttc gee gaa ttg gee ggg tea cca gac gec teg 
His Gly Asp Asp He Phe Ala Glu Leu Ala Gly Ser Pro Asp Ala Ser 
4625 4630 4635 4640 



13920 



ggt ttc ggc ate cac ccg gcg ctg ctg gac get gca ctg cac gcg atg 
Gly Phe Gly He His Pro Ala Leu Leu Asp Ala Ala Leu His Ala Met 
4645 4650 4655 



13968 



gcg ctt ggt get teg ccc gac teg gaa gcg cgt ctg ccg ttt tec tgg 
Ala Leu Gly Ala Ser Pro Asp Ser Glu Ala Arg Leu Pro Phe Ser Trp 
4660 4665 4670 



14016 



cgt ggc gee cag ctg tac cgc get gaa gga gca gcg ctt egg gta egg 
Arg Gly Ala Gin Leu Tyr Arg Ala Glu Gly Ala Ala Leu Arg Val Arg 
4675 4680 4685 



14064 
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etc teg ccg ctg ggc tec ggt gca gtc tea ttg acg ttg gtg gat gec 
Leu Ser Pro Leu Gly Ser Gly Ala Val Ser Leu Thr Leu Val Asp Ala 
4690 4695 4700 



14112 



aca ggg cga cga gtc get gcg gtg gaa teg ctt teg acg cga ccg gtc 
Thr Gly Arg Arg Val Ala Ala Val Glu Ser Leu Ser Thr Arg Pro Val 
4705 4710 4715 4720 



14160 



tec acc gac cag ate ggt gec ggt cgc ggc gat caa gag egg ctg ctg 
Ser Thr Asp Gin He Gly Ala Gly Arg Gly Asp Gin Glu Arg Leu Leu 
4725 4730 4735 



14208 



cac gtc gag tgg gta agg teg get gaa tct gcg ggg atg tct ctg acc 
His Val Glu Trp Val Arg Ser Ala Glu Ser Ala Gly Met Ser Leu Thr 
4740 4745 4750 



14256 



tec tgc gcg gtg gtc ggt ttg ggc gaa ccg gag tgg cac get gcg ctg 
Ser Cys Ala Val Val Gly Leu Gly Glu Pro Glu Trp His Ala Ala Leu 
4755 4760 4765 



14304 



aag acc act ggt gtc caa gtc gag tec cat gcg gac ctt get teg ttg 
Lys Thr Thr Gly Val Gin Val Glu Ser His Ala Asp Leu Ala Ser Leu 
4770 4775 4780 



14352 



gee acc gag gtt gee aag egg ggt tea get cct ggt gcg gtc ate gtc 
Ala Thr Glu Val Ala Lys Arg Gly Ser Ala Pro Gly Ala Val He Val 
4785 4790 4795 4800 



14400 



ccg tgc ccg cga ccc cga gcg atg cag gag ctg ccg acc gee gcg cga 
Pro Cys Pro Arg Pro Arg Ala Met Gin Glu Leu Pro Thr Ala Ala Arg 
4805 4810 4815 



14448 



agg gcg acg caa cag gcg atg gcg atg ctg cag caa tgg ctt gee gat 
Arg Ala Thr Gin Gin Ala Met Ala Met Leu Gin Gin Trp Leu Ala Asp 
4820 4825 4830 



14496 



gac egg ttc gtc agt acg cgc ctg ate 
Asp Arg Phe Val Ser Thr Arg Leu He 
4835 4840 

tec gca gtt get gga gaa gac gtg etc 
Ser Ala Val Ala Gly Glu Asp Val Leu 
4850 4855 

t99 99 c fct 9 9 tc C 9 C a 9 c 9 C 9 caa 9 C 9 
Trp Gly Leu Val Arg Ser Ala Gin Ala 
4865 4870 



ctg ctg acg cat egg gcg gtc 14544 

Leu Leu Thr His Arg Ala Val 
4845 

gac ctg gta cac gcg ccg ctg 14592 

Asp Leu Val His Ala Pro Leu 
4860 

gag cac ccg gac cga ttc gee 14640 

Glu His Pro Asp Arg Phe Ala 
4875 4880 
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ttg ate gat atg gac gac gag cga gca teg cag acg gca etc gec gaa 
Leu lie Asp Met Asp Asp Glu Arg Ala Ser Gin Thr Ala Leu Ala Glu 
4885 4890 4895 



14688 



gcg ctg act gcg gga gaa gcg cag etc gcg gtg egg teg gga gtt gtg 
Ala Leu Thr Ala Gly Glu Ala Gin Leu Ala Val Arg Ser Gly Val Val 
4900 4905 4910 



14736 



ctg gcg ccc cgc etc ggc cag gtg aag gtg agt gga ggt gaa gcg ttc 
Leu Ala Pro Arg Leu Gly Gin Val Lys Val Ser Gly Gly Glu Ala Phe 
4915 4920 4925 



14784 



agg tgg gat gaa ggc ace gtg ctg gtc acc ggc gga ace ggc ggg etc 
Arg Trp Asp Glu Gly Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu 
4930 4935 4940 



14832 



ggg gec ctg etc gca cgc cat ctg gtc age gee cac ggt gtg egg cac 
Gly Ala Leu Leu Ala Arg His Leu Val Ser Ala His Gly Val Arg His 
4945 4950 4955 4960 



14880 



ctg ttg etc gca agt cgc cgt ggt ctg gcg gcg ccc gga gcg gat gag 
Leu Leu Leu Ala Ser Arg Arg Gly Leu Ala Ala Pro Gly Ala Asp Glu 
4965 4970 4975 



14928 



ctg gtg gec gag ctg gag cag gee ggc gec gac gtc gcg gtc gtc gcg 
Leu Val Ala Glu Leu Glu Gin Ala Gly Ala Asp Val Ala Val Val Ala 
4980 4985 4990 



14976 



tgc gac teg gca gat egg gac teg ctt gcg egg ctg gtg gcg teg gtg 
Cys Asp Ser Ala Asp Arg Asp Ser Leu Ala Arg Leu Val Ala Ser Val 
4995 5000 5005 



15024 



cct gcg gaa aac ccg ttg egg gtg gtg gtg cac gee gee ggt gtg ctg 
Pro Ala Glu Asn Pro Leu Arg Val Val Val His Ala Ala Gly Val Leu 
5010 5015 5020 



15072 



gat gac ggt gtg ctg atg teg atg teg ccg gag cgc ttg gac gcg gtg 
Asp Asp Gly Val Leu Met Ser Met Ser Pro Glu Arg Leu Asp Ala Val 
5025 5030 5035 5040 



15120 



ttg egg ccc aaa gtg gat gee gcg tgg tac ctg cac gag ctg act egg 
Leu Arg Pro Lys Val Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg 
5045 5050 5055 



15168 



gaa etc ggt ctg teg gcg ttc gtg ttg ttc tec teg gtc gcg ggc ctg 
Glu Leu Gly Leu Ser Ala Phe Val Leu Phe Ser Ser Val Ala Gly Leu 
5060 5065 5070 



15216 
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ttc ggc ggt gcg ggg cag age aat tac get gee ggc aac get ttc ctg 
Phe Gly Gly Ala Gly Gin Ser Asn Tyr Ala Ala Gly Asn Ala Phe Leu 
5075 5080 5085 



15264 



gat gee ttg gcg cat tgc egg cag gec cag ggg ctg ccc gcg ctg teg 
Asp Ala Leu Ala His Cys Arg Gin Ala Gin Gly Leu Pro Ala Leu Ser 
5090 5095 5100 



15312 



ctg gee tec ggg ctg tgg gcg agt ate gat gga atg gcg ggc gac etc 
Leu Ala Ser Gly Leu Trp Ala Ser lie Asp Gly Met Ala Gly Asp Leu 
5105 5110 5115 5120 



15360 



get gcg gca gat gtg gag egg ctg teg egg gca ggc att ggc ccg ctt 
Ala Ala Ala Asp Val Glu Arg Leu Ser Arg Ala Gly lie Gly Pro Leu 
5125 5130 5135 



15408 



teg gca ccg gga ggg ctg gee ttg ttc gac get gee gtt ggc teg gac 
Ser Ala Pro Gly Gly Leu Ala Leu Phe Asp Ala Ala Val Gly Ser Asp 
5140 5145 5150 



15456 



gaa ccg ttg ctg gca ccg gtg cga ctg gat gtc gaa gca ctg cgt gtg 
Glu Pro Leu Leu Ala Pro Val Arg Leu Asp Val Glu Ala Leu Arg Val 
5155 5160 5165 



15504 



cag gee cga tec gtg cag ace egg att ccg gaa atg ctg cat ggc atg 
Gin Ala Arg Ser Val Gin Thr Arg lie Pro Glu Met Leu His Gly Met 
5170 5175 5180 



15552 



gca atg ggg cca age cgc cgc act ccg ttc act tec agg gtt gag ccg 
Ala Met Gly Pro Ser Arg Arg Thr Pro Phe Thr Ser Arg Val Glu Pro 
51B5 5190 5195 5200 



15600 



ttg cac gaa egg ctg gee gga ttg teg gag ggc gaa cgt egg cag caa 
Leu His Glu Arg Leu Ala Gly Leu Ser Glu Gly Glu Arg Arg Gin Gin 
5205 5210 5215 



15648 



gtg etc cag cgc gtc cgc gee gat ate gcg gtg gta ctg ggg cac ggc 
Val Leu Gin Arg Val Arg Ala Asp lie Ala Val Val Leu Gly His Gly 
5220 5225 5230 



15696 



agg teg age gat gtg gac ate gag aag cct ttg gee gag ctg ggt ttc 
Arg Ser Ser Asp Val Asp lie Glu Lys Pro Leu Ala Glu Leu Gly Phe 
5235 5240 5245 



15744 



gac teg ctg acg gee ate gaa etc cgc aac cgt etc get acc gee acc 
Asp Ser Leu Thr Ala lie Glu Leu Arg Asn Arg Leu Ala Thr Ala Thr 
5250 5255 5260 



15792 
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gga ctg egg ctt ccc gcg acg ctg gec ttc gac cac ggc act gcg gcg 15840 
Gly Leu Arg Leu Pro Ala Thr Leu Ala Phe Asp His Gly Thr Ala Ala 
5265 5270 5275 5280 

gca etc gec cag cac gtg tgc gcg cag eta ggc acc gcg acc gcg ccg 15888 
Ala Leu Ala Gin His Val Cys Ala Gin Leu Gly Thr Ala Thr Ala Pro 
5285 5290 5295 

gca ccg agg cga acc gac gac aac gac gec acg gag ccc gtg agg teg 15 936 
Ala Pro Arg Arg Thr Asp Asp Asn Asp Ala Thr Glu Pro Val Arg Ser 
5300 5305 5310 

etc ttc caa cag gcg tat gcg get ggc egg ata ctt gac ggg atg gat 15984 
Leu Phe Gin Gin Ala Tyr Ala Ala Gly Arg lie Leu Asp Gly Met Asp 
5315 5320 5325 

ttg gtg aag gtc get gee cag ttg cga ccg gtg ttc ggt teg cct ggc 16032 
Leu Val Lys Val Ala Ala Gin Leu Arg Pro Val Phe. Gly Ser Pro Gly 
5330 5335 5340 

gag ctg gaa tec ctg ccg aaa ccc gtc cag ctt tec cgt ggt ccc gaa 16080 
Glu Leu Glu Ser Leu Pro Lys Pro Val Gin Leu Ser Arg Gly Pro Glu 
5345 5350 5355 5360 

gag ctt gee ttg gtg tgc atg ccg gcg ctg ate ggg atg ccg ccc gca 16128 
Glu Leu Ala Leu Val Cys Met Pro Ala Leu He Gly Met Pro Pro Ala 
5365 5370 5375 

cag cag tac gcg egg ate gec gee ggg ttc cgc gat gtg egg gac gtt 16176 
Gin Gin Tyr Ala Arg He Ala Ala Gly Phe Arg Asp Val Arg Asp Val 
5380 5385 5390 

teg gtg ate ccg atg cct gga ttc att gcg gga gaa ccg ctg ccg tec 16224 
Ser Val He Pro Met Pro Gly Phe He Ala Gly Glu Pro Leu Pro Ser 
5395 5400 5405 

gee ate gag gtg gcg gtt egg acg cag gcg gag gcg gtg ctg cag gaa 16272 
Ala He Glu Val Ala Val Arg Thr Gin Ala Glu Ala Val Leu Gin Glu 
5410 5415 5420 

ttc gee ggg ggc teg ttc gta ctg gtc ggg cat tec tec ggg ggc tgg 16320 
Phe Ala Gly Gly Ser Phe Val Leu Val Gly His Ser Ser Gly Gly Trp 
5425 5430 5435 5440 

ctg gcg cac gag gta gee ggt gag ctg gag cgt cgc ggg gtc gtc ccg 16368 
Leu Ala His Glu Val Ala Gly Glu Leu Glu Arg Arg Gly Val Val Pro 
5445 5450 5455 
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gcc ggg gtc gta ctg ctg gac acc tac ate ccc ggt gag ate acg ccg 16416 
Ala Gly Val Val Leu Leu Asp Thr Tyr He Pro Gly Glu He Thr Pro 
5460 5465 5470 

agg ttc tec gtg gcg atg gcc cac egg acg tat gag aag etc gcg act 16464 
Arg Phe Ser Val Ala Met Ala His Arg Thr Tyr Glu Lys Leu Ala Thr 
5475 5480 5485 

ttc acg gac atg cag gat gtc ggt ate acc gcg atg ggc ggg tac ttc 16512 
Phe Thr Asp Met Gin Asp Val Gly He Thr Ala Met Gly Gly Tyr Phe 
5490 5495 5500 

egg atg ttc acc gag tgg act ccg acg ccg ate ggt get ccg acg ctg 16560 
Arg Met Phe Thr Glu Trp Thr Pro Thr Pro He Gly Ala Pro Thr Leu 
5505 5510 5515 5520 

ttc gtg egg acc gaa gat tgc gtc gca gac cct gaa ggg egg ccg tgg 16608 
Phe Val Arg Thr Glu Asp Cys Val Ala Asp Pro Glu Gly Arg Pro Trp 
5525 5530 5535 

aca gat gac tec tgg egg cca ggg tgg act etc gcg gat gcc acg gtc 16656 
Thr Asp Asp Ser Trp Arg Pro Gly Trp Thr Leu Ala Asp Ala Thr Val 
5540 5545 5550 

cag gtg ccg ggc gac cac ttc teg atg atg gac gag cac gcc ggg tec 16704 
Gin Val Pro Gly Asp His Phe Ser Met Met Asp Glu His Ala Gly Ser 
5555 5560 5565 

acc gca cag gca gtc gcg agt tgg ctt gac aaa etc aac cag cgc acc 16752 
Thr Ala Gin Ala Val Ala Ser Trp Leu Asp Lys Leu Asn Gin Arg Thr 
5570 5575 5580 

get egg caa cgc tga 16767 

Ala Arg Gin Arg 

5585 



<210> 50 
<211> 5588 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 50 

Met Ala Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg Val Val Val 
15 10 15 

Glu Leu Glu Glu Ala His Glu Arg Leu His Glu Leu Glu Arg Gin Glu 
20 25 30 
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His Asp Pro lie Ala lie Val Ser Met Gly Cys Arg Tyr Pro Gly Gly 
35 40 45 

Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Val Val Asp Gly Gly Asp 
50 55 60 

Ala lie Ala Asn Phe Pro Glu Asp Arg Gly Trp Asn Leu Asp Glu Leu 
65 70 75 80 

Phe Asp Pro Asp Pro Gly Arg Ala Gly Thr Ser Tyr Val Arg Glu Gly 
85 90 95 

Gly Phe Leu Arg Gly Val Ala Asp Phe Asp Ala Gly Leu Phe Gly lie 
100 105 110 

Ser Pro Arg Glu Ala Gin Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu lie Ser Trp Glu Val Phe Glu Arg Ala Gly lie Asp Pro Phe Ser 
130 135 140 

Leu Arg Gly Thr Lys Thr Gly Val Phe Ala Gly Leu lie Tyr His Asp 
145 150 155 160 

Tyr Ala Ser Arg Phe Arg Lys Thr Pro Ala Glu Phe Glu Gly Tyr Phe 
165 170 175 

Ala Thr Gly Asn Ala Gly Ser Val Ala Ser Gly Arg Val Ala Tyr Thr 
180 185 190 

Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Leu Gly Glu 
210 215 220 

Cys Asp Leu Ala Leu Ala Gly Gly He Ser Val Met Ala Thr Pro Gly 
225 230 235 240 

Ala Phe Val Glu Phe Ser Arg Gin Arg Ala Leu Ala Ser Asp Gly Arg 
245 250 255 

Cys Lys Pro Phe Ala Asp Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly 
260 265 270 



Ala Gly Met Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly 
275 280 285 



301 
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His Pro Val Leu Ala Ala Val Val Gly Ser Ala lie Asn Gin Asp Gly 
290 295 300 

Thr Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val 
305 310 315 320 

lie Arg Gin Ala Leu Ala Asn Ala Gly Leu Ser Pro Ala Glu Val Asp 
325 330 335 

Val Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu 
340 345. 350 

Ala Gin Ala Leu He Ala Thr Tyr Gly Ala Asn Arg Ser Ala Asp His 
355 360 365 

Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala 
370 375 , 380 

Ala Ala Gly Val Ala Gly Val He Lys Ser Val Leu Ala He Arg His 
385 390 395 400 

Arg Glu Met Pro Arg Ser Leu His He Asp Gin Pro Ser Gin His Val 
405 410 415 

Asp Trp Ser Ala Gly Ala Val Arg Leu Leu Thr Asp Ser Val Asp Trp 
420 425 430 

Pro Asp Leu Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Met 
435 440 445 

Ser Gly Thr Asn Ala His Leu He Val Glu Glu Val Ser Asp Glu Pro 
450 455 460 

Val Ser Gly Ser Thr Glu Pro Thr Gly Ala Phe Pro Trp Pro Leu Ser 
465 470 475 480 

Gly Lys Thr Glu Thr Ala Leu Arg Glu Gin Ala Ala Glu Leu Leu Ser 
485 490 495 

Val Val Thr Glu His Pro Glu Pro Gly Leu Gly Asp Val Gly Tyr Ser 
500 505 510 

Leu Ala Thr Gly Arg Ala Ala Met Glu His Arg Ala Val Val Val Ala 
515 520 525 

Asp Asp Arg Asp Ser Phe Val Ala Gly Leu Thr Ala Leu Ala Ala Gly 
530 535 540 
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Val Pro Ala Ala Asn Val Val Gin Gly Ala Ala Asp Cys Lys Gly Lys 
545 550 555 560 

Val Ala Phe Val Phe Pro Gly Gin Gly Ser His Trp Gin Gly Met Ala 
565 570 575 

Arg Glu Leu Ser Glu Ser Ser Pro Val Phe Arg Arg Lys Leu Ala Glu 
580 585 590 

Cys Ala Ala Ala Thr Ala Pro Tyr Val Asp Trp Ser Leu Leu Gly Val 
595 600 605 

Leu Arg Gly Asp Pro Asp Ala Pro Ala Leu Asp Arg Asp Asp Val lie 
610 615 620 

Gin Leu Ala Leu Phe Ala Met Met Val Ser Leu Ala Glu Leu Trp Arg 
625 630 635 640 

Ser Cys Gly Val Glu Pro Ala Ala Val Val Gly His Ser Gin Gly Glu 
645 650 655 

He Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu Thr Asp Ala Val 
660 665 670 

Arg He lie Ala Ala Arg Cys Asp Ala Val Ser Ala Leu Thr Gly Lys 
675 680 685 

Gly Gly Met Leu Ala He Ala Leu Pro Glu Ser Ala Val Val Lys Arg 
690 695 700 

He Ala Gly Leu Pro Glu Leu Thr Val Ala Ala Val Asn Gly Pro Gly 
705 710 715 720 

Ser Thr Val Val Ser Gly Glu Pro Ser Ala Leu Glu Arg Leu Gin Thr 
725 730 735 

Glu Leu Thr Ala Glu Asn Val Gin Thr Arg Arg Val Gly He Asp Tyr 
740 745 750 

Ala Ser His Ser Pro Gin He Ala Gin Val Gin Gly Arg Leu Leu Asp 
755 760 765 

Arg Leu Gly Glu Val Gly Ser Glu Pro Ala Glu He Ala Phe Tyr Ser 
770 775 780 



Thr Val Thr Gly Glu Arg Thr Asp Thr Gly Arg Leu Asp Ala Asp Tyr 
785 790 795 800 
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Trp Tyr Gin Asn Leu Arg Gin Pro Val Arg Phe Gin Gin Thr Val Ala 
805 810 815 

Arg Met Ala Asp Gin Gly Tyr Arg Phe Phe Val Glu Val Ser Pro His 
820 825 830 

Pro Leu Leu Thr Ala Gly He Gin Glu Thr Leu Glu Ala Ala Asp Ala 
835 840 845 

Gly Gly Val Val Val Gly Ser Leu Arg Arg Gly Glu Gly Gly Ser Arg 
850 855 860 

Arg Trp Leu Thr Ser Leu Ala Glu Cys Gin Val Arg Gly Leu Pro Val 
865 870 875 880 

Asn Trp Glu Gin Val Phe Leu Asn Thr Gly Ala Arg Arg Val Pro Leu 
885 890 895 

Pro Thr Tyr Pro Phe Gin Arg Gin Arg Tyr Trp Leu Glu Ser Ala Glu 
900 905 910 

Tyr Asp Ala Gly Asp Leu Gly Ser Val Gly Leu Leu Ser Ala Glu His 
915 920 925 

Pro Leu Leu Gly Ala Ala Val Thr Leu Ala Asp Ala Gly Gly Phe Leu 
930 935 940 

Leu Thr Gly Lys Leu Ser Val Lys Thr Gin Pro Trp Leu Ala Asp His 
945 950 955 960 

Val Val Gly Gly Ala He Leu Leu Pro Gly Thr Ala Phe Val Glu Met 
965 970 975 

Leu He Arg Ala Ala Asp Gin Val Gly Cys Asp Leu He Glu Glu Leu 
980 985 990 

Ser Leu Thr Thr Pro Leu Val Leu Pro Ala Thr Gly Ala Val Gin Val 
995 1000 1005 

Gin He Ala Val Gly Gly Pro Asp Glu Ala Gly Arg Arg Ser Val Arg 
1010 1015 1020 

Val His Ser Cys Arg Asp Asp Ala Val Pro Gin Asp Ser Trp Thr Cys 
025 1030 1035 1040 

His Ala Thr Gly Thr Leu Thr Ser Ser Asp His Gin Asp Ala Gly Gin 
1045 1050 1055 
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Gly Pro Asp Gly lie Trp Pro Pro Asn Asp Ala Val Ala Val Pro Leu 
1060 1065 1070 

Asp Ser Phe Tyr Ala Arg Ala" Ala Glu Arg Gly Phe Asp Phe Gly Pro 
1075 1080 1085 

Ala Phe Gin Gly Leu Gin Ala Ala Trp Lys Arg Gly Asp Glu He Phe 
1090 1095 1100 

Ala Glu Val Gly Leu Pro Thr Ala His Arg Glu Asp Ala Gly Arg Phe 
105 1110 1115 1120 

Gly He His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Leu Gly Ala 
1125 1130 1135 

Ala Glu Glu Asp Pro Asp Glu Gly Trp Leu Pro Phe Ala Trp Gin Gly 
1140 H45 1150 

Val Ser Leu Lys Ala Thr Gly Ala Leu Ser Leu Arg Val His Leu Val 
1155 1160 1165 

Pro Ala Gly Ala Asn Ala Val Ser Val Phe Thr Thr Asp Thr Thr Gly 
1170 1175 1180 

Gin Ala Val Leu Ser He Asp Ser Leu Val Leu Arg Gin He Ser Asp 
185 1190 1195 1200 

Lys Gin Leu Ala Ala Ala Arg Ala Met Glu His Glu Ser Leu Phe Arg 
1205 1210 1215 

Val Asp Trp Lys Arg He Ser Pro Gly Ala Ala Lys Pro Val Ser Trp 
1220 1225 1230 

Ala Val He Gly Asn Asp Glu Leu Ala Arg Ala Cys Gly Ser Ala Leu 
1235 1240 1245 

Gly Thr Glu Leu His Pro Asp Leu Thr Gly Leu Ala Asp Pro Pro Pro 
1250 1255 1260 

Asp Val Val Val Val Pro Cys Gly Ala Ser Arg Gin Asp Leu Asp Val 
265 1270 1275 1280 

Ala Ser Glu Ala Arg Ala Ala Thr Gin Arg Met Leu Asp Leu He Gin 
1285 1290 1295 

Asp Trp Leu Ala Ala Ala Arg Phe Ala Gly Ser Arg Leu Val Val Val 
1300 1305 1310 
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Thr Cys Gly Ala Ala Ser Thr Gly Pro Ala Glu Gly Val Ser Asp Leu 
1315 1320 1325 

Val His Ala Ala Ser Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Asn 
1330 1335 1340 

Pro Asp Arg Phe Val Leu Val Asp Val Asp Gly Thr Ala Glu Ser Trp 
345 1350 1355 1360 

Arg Ala Leu Ala Ala Ala Val Arg Ser Gly Glu Pro Gin Leu Ala Leu 
1365 1370 1375 

Arg Ala Gly Glu Val Arg Val Pro Arg Leu Ala Arg Cys Val Ala Ala 
1380 1385 1390 

Glu Asp Ser Arg He Pro Val Pro Gly Ala Asp Gly Thr Val Leu He 
1395 1400 1405 

Ser Gly Gly Thr Gly Leu Leu Gly Gly Leu Val Ala Arg His Leu Val 
1410 1415 1420 

Ala Glu Arg Gly Val Arg Arg Leu Val Leu Ala Gly Arg Arg Gly Trp 
425 1430 1435 1440 

Ser Ala Pro Gly Val Thr Asp Leu Val Asp Glu Leu Val Gly Leu Gly 
1445 1450 1455 

Ala Ala Val Glu Val Ala Ser Cys Asp Val Gly Asp Arg Ala Gin Leu 
1460 1465 1470 

Asp Arg Leu Leu Thr Thr He Ser Ala Glu Phe Pro Leu Arg Gly Val 
1475 1480 1485 

Val His Ala Ala Gly Ala Leu Ala Asp Gly Val Val Glu Ser Leu Thr 
1490 1495 1500 

Pro Glu His Val Ala Lys Val Phe Gly Pro Lys Ala Ala Gly Ala Trp 
505 1510 1515 1520 

His Leu His Glu Leu Thr Leu Asp Leu Asp Leu Ser Phe Phe Val Leu 
1525 1530 1535 

Phe Ser Ser Phe Ser Gly Val Ala Gly Ala Ala Gly Gin Gly Asn Tyr 
1540 1545 1550 

Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu Ala Gin His Arg Arg Thr 
1555 1560 1565 
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Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro 
1570 1575 1580 

Ser Gly Met Thr Gly Ala Leu Asp Ala Ala Gly Arg Ser Arg He Ala 
585 1590 1595 1600 

Arg Thr Asn Pro Pro Met Ser Ala Pro Asp Gly Leu Arg Leu Phe Glu 
1605 1610 1615 

Met Ala Phe Arg Val Pro Gly Glu Ser Leu Leu Val Pro Val His Val 
1620 1625 1630 

Asp Leu Asn Ala Leu Arg Ala Asp Ala Ala Asp Gly Gly Val Pro Ala 
1635 1640 1645 

Leu Leu Arg Asp Leu Val Pro Ala Pro Val Arg Arg Ser Ala Val Asn 
1650 1655 1660 

Glu Ser Ala Asp Val Asn Gly Leu Val Gly Arg Leu Arg Arg Leu Pro 
665 1670 1675 1680 

Asp Leu Asp Gin Glu Thr Gin Leu Leu Gly Leu Val Arg Glu His Val 
1685 1690 1695 

Ser Ala Val Leu Gly His Ser Gly Ala Val Glu Val Gly Ala Asp Arg 
1700 1705 1710 

Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Phe Arg 
1715 1720 1725 

Asn Arg Leu Gly Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala Val 
1730 1735 1740 

Phe Asp Tyr Pro Thr Pro Arg Ala Leu Val Arg Phe Leu Leu Asp Lys 
74 5 1750 1755 1760 

Leu He Gly Gly Val Glu Ala Pro Thr Pro Ala Pro Ala Ala Val Ala 
1765 1770 1775 

Ala Val Thr Ala Asp Asp Pro Val Val He Val Gly Met Gly Cys Arg 
1780 1785 1790 

Tyr Pro Gly Gly Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala 
1795 1800 1805 

Gly Gly Leu Asp Ala Val Ala Glu Phe Pro Asp Asp Arg Gly Trp Asp 
1810 1815 1820 
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Gin Ala Gly Leu Phe Asp Pro Asp Pro Asp Arg Leu Gly Thr Ser Tyr 
825 1830 1835 1840 

Val Cys Glu Gly Gly Phe Leu Arg Asp Ala Ala Glu Phe Asp Ala Gly 
1845 1850 1855 

Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin 
1860 1865 1870 

Arg Leu Leu Leu Glu Val Ala Trp Glu Thr Val Glu Arg Ala Gly lie 
1875 1880 1885 

Asp Pro Leu Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu 
1890 1895 1900 

Met His His Asp Tyr Gly Ala Arg Phe He Thr Arg Ala Pro Glu Gly 
905 1910 1915 1920 

Phe Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Gly Val Phe Ser Gly 
1925 1930 1935 

Arg Val Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp 
1940 1945 1950 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ala 
1955 1960 1965 

Leu Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val 
1970 1975 1980 

Met Ala Thr Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
985 1990 1995 2000 

Ala Ala Asp Gly Arg Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr 
2005 2010 2015 

Gly Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp 
2020 2025 2030 

Ala Arg Arg Asn Gly His Ala Val Leu Ala Val Val Arg Gly Ser Ala 
2035 2040 2045 

Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
2050 2055 2060 

Ser Gin Gin Arg Val He Thr Gin Ala Leu Ala Ser Ala Gly Leu Ser 
065 2070 2075 2080 
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Val Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
2085 2090 2095 

Gly Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr Gly Gin Gly 
2100 2105 2110 

Arg Asp Ser Asp Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He 
2115 2120 2125 

Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val 
2130 2135 2140 

Met Ala Met Arg His Gly Gin Leu Pro Ala Thr Leu His Val Asp Glu 
145 2150 2155 2160 

Pro Thr Ser Glu Val Asp Trp Ser Ala Gly Asp Val Gin Leu Leu Thr 
2165 2170 2175 

Glu Asn Thr Pro Trp Pro Gly Asn Ser His Pro Arg Arg Val Gly Val 
2180 2185 2190 

Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Val He Leu Glu Gin 
2195 2200 2205 

Ala Ser Lys Thr Pro Asp Glu Thr Ala Asp Lys Ser Gly Pro Asp Ser 
2210 2215 2220 

Glu Ser Thr Val Asp Leu Pro Ala Val Pro Leu He Val Ser Gly Arg 
225 2230 2235 2240 

Thr Pro Ala Ala Leu Ser Ala Gin Ala Ser Ala Leu Leu Ser Tyr Leu 
2245 2250 2255 

Gly Glu Arg Gly Asp He Ser Thr Leu Asp Ala Ala Phe Ser Leu Ala 
2260 2265 2270 

Ser Ser Arg Ala Ala Leu Glu Glu Arg Ala Val Val Leu Gly Ala Asp 
2275 2280 2285 

Arg Glu Thr Leu Leu Ser Gly Leu Glu Ala Leu Ala Ser Gly Arg Glu 
2290 2295 2300 

Ala Ser Gly Val Val Ser Gly Ser Pro Val Ser Gly Gly Val Gly Phe 
305 2310 2315 2320 

Val Phe Ala Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Gly Leu 
2325 2330 2335 
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Tyr Ser Val Phe Pro Val Phe Ala Asp Ala Phe Asp Glu Ala Cys Ala 
2340 2345 2350 

Gly Leu Asp Ala His Leu Gly Gin Asp Val Gly Val Arg Asp Val Val 
2355 2360 2365 

Phe Gly Ser Asp Gly Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser 
2370 2375 2380 

Gly Leu Phe Ala Leu Gin Val Gly Leu Leu Ser Leu Leu Gly Ser Trp 
385 2390 2395 2400 

Gly Val Arg Pro Gly Val Val Leu Gly His Ser Val Gly Glu Phe Ala 
2405 2410 2415 

Ala Ala Val Ala Ala Gly Val Leu Ser Leu Pro Asp Ala Ala. Arg Met 
2420 2425 2430 

Val Ala Gly Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala 
2435 2440 2445 

Met Leu Ala Val Ala Ala Gly Glu Glu Gin Leu Arg Pro Leu Leu Ala 
2450 2455 2460 

Asp Arg Val Asp Gly Ala Gly He Ala Ala Val Asn Ala Pro Glu Ser 
465 2470 2475 2480 

Val Val Leu Ser Gly Asp Arg Glu Val Leu Asp Asp He Ala Gly Ala 
2485 2490 2495 

Leu Asp Gly Gin Gly He Arg Trp Arg Arg Leu Arg Val Ser His Ala 
2500 2505 2510 

Phe His Ser Tyr Arg Met Asp Pro Met Leu Gin Glu Phe Ala Glu He 
2515 2520 2525 

Ala Arg Ser Val Asp Tyr Arg Arg Gly Asp Leu Pro Val Val Ser Thr 
2530 2535 2540 

Leu Thr Gly Glu Leu Asp Thr Ala Gly Val Met Ala Thr Pro Glu Tyr 
545 2550 2555 2560 

Trp Val Arg Gin Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Arg 
2565 2570 2575 

Val Leu Ala Gin Gin Gly Val Ala Thr He Phe Glu Leu Gly Pro Asp 
2580 2585 2590 
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Ala Thr Leu Ser Ala Leu lie Pro Asp Cys His Ser Trp Ala Asp Gin 
2595 2600 2605 

Ala Met Pro He Pro Met Leu Arg Lys Asp Arg Thr Glu Thr Glu Thr 
2610 2615 2620 

Val Val Ala Ala Val Ala Arg Ala His Thr Arg Gly Val Pro Val Glu 
625 2630 2635 2640 

Trp Ser Ala Tyr Phe Ala Gly Thr Gly Ala Arg Arg Val Glu Leu Pro 
2645 2650 2655 

Thr Tyr Ala Phe Gin Arg Gin Arg Tyr Trp Leu Glu Thr Ser Asp Tyr 
2660 2665 2670 

Gly Asp Val Thr Gly He Gly Leu Ala Ala Ala Glu His Pro Leu Leu 
2675 2680 2685 

Gly Ala Val Val Ala Leu Ala Asp Gly Asp Gly Met Val Leu Thr Gly 
2690 2695 2700 

Arg Leu Ser Val Gly Thr His Pro Trp Leu Ala Gin His Arg Val Leu 
705 2710 2715 2720 

Gly Glu Val Val Val Pro Gly Thr Ala He Leu Glu Met Ala Leu His 
2725 2730 2735 

Ala Gly Ala Arg Leu Gly Cys Asp Arg Val Glu Glu Leu Thr Leu Glu 
2740 2745 2750 

Thr Pro Leu Val Val Pro Glu Arg Ala Ala Gly Ala Gly Ser Arg Gly 
2755 2760 2765 

Pro Ala Gly Gly Thr Thr Val Ser He Glu Thr Ala Glu Glu Arg Val 
2770 2775 2780 

Arg Thr Asn Asp Ala He Glu He Gin Leu Leu Val Asn Ala Pro Asp 
785 2790 2795 2800 

Glu Gly Gly Arg Arg Arg Val Ser Leu Tyr Ser Arg Pro Ala Gly Gly 
2805 2810 2815 

Ser Arg Gly Gly Gly Trp Thr Arg His Ala Thr Gly Glu Leu Val Val 
2820 2825 2830 

Gly Thr Thr Gly Gly Arg Ala Val Pro Asp Trp Ser Ala Glu Gly Ala 
2835 2840 2845 
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Glu Ser He Ala Leu Asp Glu Phe Tyr Val Ala Leu Ala Gly Asn Gly 
2850 2855 2860 

Phe Glu Tyr Gly Pro Leu Phe Gin Gly Leu Gin Ala Ala Trp Arg Arg 
865 2870 2875 2880 

Gly Asp Glu Val Leu Ala Glu He Ala Pro Pro Ala Glu Ala Asp Ala 
2885 2890 2895 

Met Ala Ser Gly Tyr Leu Leu Asp Pro Ala Leu Leu Asp Ala Ala Leu 
2900 2905 2910 

Gin Ala Ser Ala Leu Gly Asp Arg Pro Glu Gin Gly Gly Ala Trp Leu 
2915 2920 2925 

Pro Phe Ser Phe Thr Gly Val Glu Leu Ser Ala Pro Ala Gly Thr He 
2930 2935 2940 

Ser Arg Val Arg Leu Glu Thr Arg Arg Pro Asp Ala He Ser Val Ala 
945 2950 2955 2960 

Val Met Asp Glu Ser Gly Arg Leu Leu Ala Ser He Asp Ser Leu Arg 
2965 2970 2975 

Leu Arg Ser Val Ser Ser Gly Gin Leu Ala Asn Arg Asp Ala Val Arg 
2980 2985 2990 

Asp Ala Leu Phe Glu Val Thr Trp Glu Pro Val Ala Thr Gin Ser Thr 
2995 3000 3005 

Glu Pro Gly Arg Trp Ala Leu Leu Gly Asp Thr Ala Cys Gly Lys Asp 
3010 3015 3020 

Asp Leu lie Lys Leu Ala Thr Asp Ser Ala Asp Arg Cys Ala Asp Leu 
025 3030 3035 3040 

Ala Ala Leu Ala Glu Lys Leu Asp Ser Ser Ala Leu Val Pro Asp Val 
3045 3050 3055 

Val Val Tyr Cys Ala Gly Glu Gin Ala Asp Pro Gly Thr Gly Ala Ala 
3060 3065 3070 

Ala Leu Ala Glu Thr Gin Gin Thr Leu Ala Leu Leu Gin Ala Trp Leu 
3075 3080 3085 

Ala Glu Pro Arg Leu Ala Glu Ala Arg Leu Val Val Val Thr Cys Ala 
3090 3095 3100 
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Ala Val Thr Thr Ala Pro Ser Asp Gly Ala Ser Glu Leu Ala His Ala 
105 3110 3115 3120 

Pro Leu Trp Gly Leu Leu Arg Ala Ala Gin Val Glu Asn Pro Gly Gin 
3125 3130 3135 

Phe Val Leu Ala Asp Val Asp Gly Thr Ala Glu Ser Trp Arg Ala Leu 
3140 3145 3150 

Pro Ser Ala Leu Gly Ser Met Glu Pro Gin Leu Ala Leu Arg Lys Gly 
3155 3160 3165 

Ala Val Arg Ala Pro Arg Leu Ala Ser Val Ala Gly Gin He Asp Val 
3170 3175 3180 

Pro Ala Val Val Ala Asp Pro Asp Arg Thr Val Leu He Ser Gly Gly 
185 3190 3195 3200 



Thr Gly Leu Leu Gly Gly Ala Val Ala Arg His Leu Val Thr Glu Arg 
3205 3210 3215 

Gly Val Arg Arg Leu Val Leu Thr Gly Arg Arg Gly Trp Asp Ala Pro 
3220 3225 3230 

Gly He Thr Glu Leu Val Gly Glu Leu Asn Gly Leu Gly Ala Val Val 
3235 3240 3245 

Asp Val Val Ala Cys Asp Val Ala Asp Arg Ala Asp Leu Glu Ser Leu 
3250 3255 3260 

Leu Ala Ala Val Pro Ala Glu Phe Pro Leu Cys Gly Val Val His Ala 
265 3270 3275 3280 

Ala Gly Ala Leu Ala Asp Gly Val He Glu Ser Leu Ser Pro Asp Asp 
3285 3290 3295 

Val Gly Ala Val Phe Gly Pro Lys Ala Ala Gly Ala Trp Asn Leu His 
3300 3305 3310 

Glu Leu Thr Arg Asp Thr Asp Leu Ser Phe Phe Ala Leu Phe Ser Ser 
3315 3320 3325 

Leu Ser Gly Val Ala Gly Ala Pro Gly Gin Gly Asn Tyr Ala Ala Ala 
3330 3335 3340 

Asn Ala Phe Leu Asp Ala Leu Ala His Tyr Arg Arg Ser Gin Gly Leu 
345 3350 3355 3360 
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Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Glu Gin Pro Ser Gly Met 
3365 3370 3375 

Thr Glu Thr Leu Ser Glu Val Asp Arg Ser Arg lie Ala Arg Ala Asn 
3380 3385 3390 

Pro Pro Leu Ser Thr Lys Glu Gly Leu Arg Leu Phe Asp Ala Gly Leu 
3395 3400 .3405 

Ala Leu Asp Arg Ala Ala Val Val Pro Ala Lys Leu Asp Arg Thr Phe 
3410 3415 3420 

Leu Ala Glu Gin Ala Arg Ser Gly Ser Leu Pro Ala Leu Leu Thr Ala 
425 3430 3435 3440 

Leu Val Pro Pro lie Arg Arg Asn Arg Arg Ala Ser Gly Thr Glu Leu 
3445 3450 3455 

Ala Asp Glu Gly Thr Leu Leu Gly Val Val Arg Glu His Ala Ala Ala 
3460 3465 3470 

Val Leu Gly Tyr Ser Ser Ala Ala Asp Val Gly Val Glu Arg Ala Phe 
3475 3480 3485 

Arg Asp Leu Gly Phe Asp Ser Leu Ser Gly Val Glu Leu Arg Asn Arg 
3490 3495 3500 

Leu Ala Gly Val Leu Gly Val Arg Leu Pro Ala Thr Ala Val Phe Asp 
505 3510 3515 3520 

Tyr Pro Thr Pro Arg Ala Leu Ala Arg Phe Leu His Gin Glu Leu Ala 
3525 3530 3535 

Asp Glu He Ala Thr Thr Pro Ala Pro Val Thr Thr Thr Arg Ala Pro 
3540 3545 3550 

Val Ala Glu Asp Asp Leu Val Ala He Val Gly Met Gly Cys Arg Phe 
3555 3560 3565 

Pro Gly Gin Val Ser Ser Pro Glu Glu Leu Trp Arg Leu Val Ala Gly 
3570 3575 3580 

Gly Val Asp Ala Val Ala Asp Phe Pro Ala Asp Arg Gly Trp Asp Leu 
585 3590 3595 3600 

Ala Gly Leu Phe Asp Pro Asp Pro Glu Arg Ala Gly Lys Thr Tyr Val 
3605 3610 3615 
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Arg Glu Gly Ala Phe Leu Thr Asp Ala Asp Arg Phe Asp Ala Gly Phe 
3620 3625 3630 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
3635 3640 3645 

Leu Leu Leu Glu Leu Ser Trp Glu Ala He Glu Arg Ala Gly He Asp 
3650 3655 3660 

Pro Gly Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met 
665 3670 3675 3680 

Tyr His Asp Tyr Gly Ala Arg Phe Ala Ser Arg Ala Pro Glu Gly Phe 
3685 3690 3695 

Glu Gly Tyr Leu Gly Asn Gly Ser Ala Gly Ser Val Ala Ser Gly Arg 
3700 3705 3710 

He Ala Tyr Ser Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
3715 3720 3725 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Gly Gin Ser Leu 
3730 3735 3740 

Arg Ser Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met 
745 3750 3755 3760 

Ser Thr Pro Gly Thr Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala 
3765 3770 3775 

Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ser Ala Asp Gly Thr Gly 
3780 3785 3790 

Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala 
3795 3800 3805 

Arg Arg Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val 
3810 3815 3820 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
825 3830 3835 3840 

Gin Gin Arg Val He Gin Gin Ala Leu Ala Ser Ala Gly Leu Ser Val 
3845 3850 3855 

Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
3860 3865 3870 
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Asp Pro He Glu Ala Gin Ala Leu He Ala Thr Tyr Gly Arg Asp Arg 
3875 3880 3885 

Asp Pro Gly Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn He Gly 
3890 3895 3900 

His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Met 
905 3910 3915 3920 

Ala Met Arg His Gly Gin Leu Pro Arg Thr Leu His Val Asp Ala Pro 
3925 3930 3935 

Ser Ser Gin Val Asp Trp Ser Ala Gly Arg Val Gin Leu Leu Thr Glu 
3940 3945 3950 

Asn Thr Pro Trp Pro Asp Ser Gly Arg Pro Cys Arg Val Gly Val Ser 
3955 3960 3965 

Ser Phe Gly He Ser Gly Thr Asn Ala His Val He Leu Glu Gin Ser 
3970 3975 3980 

Thr Gly Gin Met Asp Gin Ala Ala Glu Pro Asp Ser Ser Pro Val Leu 
985 3990 3995 4000 

Asp Val Pro Val Val Pro Trp Val Val Ser Gly Lys Thr Pro Glu Ala 
4005 4010 4015 

Leu Ser Ala Gin Ala Ala Thr Leu Ala Thr Tyr Leu Asp Gin Asn Val 
4020 4025 4030 

Asp Val Ser Pro Leu Asp Val Gly He Ser Leu Ala Val Thr Arg Ser 
4035 4040 4045 



Ala Leu Asp Glu Arg Ala Val Val Leu Gly Ser Asp Arg Asp Thr Leu 
4050 4055 4060 

Leu Ser Gly Leu Asn Ala Leu Ala Ala Gly His Glu Ala Ala Gly Val 
065 4070 4075 4080 

Val Thr Gly Pro Val Gly He Gly Gly Arg Thr Gly Phe Val Phe Ala 
4085 4090 4095 

Gly Gin Gly Gly Gin Trp Leu Gly Met Gly Arg Arg Leu Tyr Ser Glu 
4100 4105 4110 

Phe Pro Ala Phe Ala Gly Ala Phe Asp Glu Ala Cys Ala Glu Leu Asp 
4115 4120 4125 
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Ala Asn Leu Gly Arg Glu Val Gly Val Arg Asp Val Val Phe Gly Ser 
4130 4135 4140 

Asp Glu Ser Leu Leu Asp Arg Thr Leu Trp Ala Gin Ser Gly Leu Phe 
145 4150 4155 4160 

Ala Leu Gin Val Gly Leu Trp Glu Leu Leu Gly Thr Trp Gly Val Arg 
4165 4170 4175 

Pro Ser Val Val Leu Gly His Ser Val Gly Glu Leu Ala Ala Ala Phe 
4180 4185 4190 

Ala Ala Gly Val Leu Ser Met Ala Glu Ala Ala Arg Leu Val Ala Gly 
4195 4200 4205 

Arg Ala Arg Leu Met Gin Ala Leu Pro Ser Gly Gly Ala Met Leu Ala 
4210 4215 4220 

Val Ser Ala Thr Glu Ala Arg Val Gly Pro Leu Leu Asp Gly Val Arg 
225 4230 4235 4240 

Asp Arg Val Gly Val Ala Ala Val Asn Ala Pro Gly Ser Val Val Leu 
4245 4250 4255 

Ser Gly Asp Arg Asp Val Leu Asp Gly He Ala Gly Arg Leu Asp Gly 
4260 4265 4270 

Gin Gly He Arg Ser Arg Trp Leu Arg Val Ser His Ala Phe His Ser 
4275 4280 4285 

His Arg Met Asp Pro Met Leu Ala Glu Phe Ala Glu Leu Ala Arg Ser 
4290 4295 4300 

Val Asp Tyr Arg Ser Pro Arg Leu Pro He Val Ser Thr Leu Thr Gly 
305 4310 4315 4320 

Asn Leu Asp Asp Val Gly Val Met Ala Thr Pro Glu Tyr Trp Val Arg 
4325 4330 433*5 

Gin Val Arg Glu Pro Val Arg Phe Ala Asp Gly Val Gin Ala Leu Val 
4340 4345 4350 

Asp Gin Gly Val Asp Thr He Val Glu Leu Gly Pro Asp Gly Ala Leu 
4355 4360 4365 

Ser Ser Leu Val Gin Glu Cys Val Ala Glu Ser Gly Arg Ala Thr Gly 
4370 4375 4380 
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lie Pro Leu.Val Arg Arg Asp Arg Asp Glu Val Arg Thr Val Leu Asp 
385 4390 4395 4400 

Ala Leu Ala Gin Thr His Thr Arg Gly Gly Ala Val Asp Trp Gly Ser 
4405 4410 4415 

Phe Phe Ala Gly Thr Arg Ala Thr Gin Val Asp Leu Pro Thr Tyr Ala 
4420 4425 4430 

Phe Gin Arg Gin Arg Tyr Trp Leu Glu Pro Ser Asp Ser Gly Asp Val 
4435 4440 4445 

Thr Gly Val Gly Leu Thr Gly Ala Glu His Pro Leu Leu Gly Ala Val 
4450 4455 4460 

Val Pro Val Ala Gly Gly Asp Glu Val Leu Leu Thr Gly Arg Leu Ser 
465 4470 4475 4480 

Val Gly Thr His Pro Trp Leu Ala Glu His Arg Val Leu Gly Glu Val 
4485 4490 4495 

Val Val Pro Gly Thr Ala Leu Leu Glu Met Ala Trp Arg Ala Gly Ser 
4500 4505 4510 

Gin Val Gly Cys Glu Arg Val Glu Glu Leu Thr Leu Glu Ala Pro Leu 
4515 4520 4525 

Val Leu Pro Glu Arg Gly Ala Ala Ala Val Gin Leu Ala Val Gly Ala 
4530 4535 4540 

Pro Asp Glu Ala Gly Arg Arg Ser Leu Gin Leu Tyr Ser Arg Gly Ala 
545 4550 4555 4560 

Asp Glu Asp Gly Asp Trp Arg Arg lie Ala Ser Gly Leu Leu Ala Gin 
4565 4570 4575 

Ala Asn Ala Val Pro Pro Ala Asp Ser Thr Ala Trp Pro Pro Asp Gly 
4580 4585 4590 

Ala Gly Gin Val Asp Leu Ala Glu Phe Tyr Glu Arg Leu Ala Glu Arg 
4595 4600 4605 

Gly Leu Thr Tyr Gly Pro Val Phe Gin Gly Leu Arg Ala Ala Trp Arg 
4610 4615 4620 

His Gly Asp Asp lie Phe Ala Glu Leu Ala Gly Ser Pro Asp Ala Ser 
625 4630 4635 4640 
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Gly Phe Gly lie His Pro Ala Leu Leu Asp Ala Ala Leu His Ala Met 
4645 4650 4655 

Ala Leu Gly Ala Ser Pro Asp Ser Glu Ala Arg Leu Pro Phe Ser Trp 
4660 4665 4670 

Arg Gly Ala Gin Leu Tyr Arg Ala Glu Gly Ala Ala Leu Arg Val Arg 
4675 4680 4685 

Leu Ser Pro Leu Gly Ser Gly Ala Val Ser Leu Thr Leu Val Asp Ala 
4690 4695 4700 

Thr Gly Arg Arg Val Ala Ala Val Glu Ser Leu Ser Thr Arg Pro Val 
705 4710 4715 4720 

Ser Thr Asp Gin lie Gly Ala Gly Arg Gly Asp Gin Glu Arg Leu Leu 
4725 4730 4735 

His Val Glu Trp Val Arg Ser Ala Glu Ser Ala Gly Met Ser Leu Thr 
4740 4745 4750 

Ser Cys Ala Val Val Gly Leu Gly Glu Pro Glu Trp His Ala Ala Leu 
4755 4760 4765 

Lys Thr Thr Gly Val Gin Val Glu Ser His Ala Asp Leu Ala Ser Leu 
4770 4775 4780 

Ala Thr Glu Val Ala Lys Arg Gly Ser Ala Pro Gly Ala Val He Val 
785 4790 4795 4800 

Pro Cys Pro Arg Pro Arg Ala Met Gin Glu Leu Pro Thr Ala Ala Arg 
4805 4810 4815 

Arg Ala Thr Gin Gin Ala Met Ala Met Leu Gin Gin Trp Leu Ala Asp 
4820 4825 4830 

Asp Arg Phe Val Ser Thr Arg Leu He Leu Leu Thr His Arg Ala Val 
4835 4840 4845 

Ser Ala Val Ala Gly Glu Asp Val Leu Asp Leu Val His Ala Pro Leu 
4850 4855 4860 

Trp Gly Leu Val Arg Ser Ala Gin Ala Glu His Pro Asp Arg Phe Ala 
865 4870 4875 4880 

Leu He Asp Met Asp Asp Glu Arg Ala Ser Gin Thr Ala Leu Ala Glu 
4885 4890 4895 
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Ala Leu Thr Ala Gly Glu Ala Gin Leu Ala Val Arg Ser Gly Val Val 
4900 4905 4910 

Leu Ala Pro Arg Leu Gly Gin Val Lys Val Ser Gly Gly Glu Ala Phe 
4915 4920 4925 

Arg Trp Asp Glu Gly Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu 
4930 4935 4940 

Gly Ala Leu Leu Ala Arg His Leu Val Ser Ala His Gly Val Arg His 
945 4950 4955 4960 

Leu Leu Leu Ala Ser Arg Arg Gly Leu Ala Ala Pro Gly Ala Asp Glu 
4965 4970 4975 

Leu Val Ala Glu Leu Glu Gin Ala Gly Ala Asp Val Ala Val Val Ala 
4980 4985 4990 

Cys Asp Ser Ala Asp Arg Asp Ser Leu Ala Arg Leu Val Ala Ser Val 
4995 5000 5005 

Pro Ala Glu Asn Pro Leu Arg Val Val Val His Ala Ala Gly Val Leu 
5010 5015 5020 

Asp Asp Gly Val Leu Met Ser Met Ser Pro Glu Arg Leu Asp Ala Val 
025 5030 5035 5040 

Leu Arg Pro Lys Val Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg 
5045 5050 5055 

Glu Leu Gly Leu Ser Ala Phe Val Leu Phe Ser Ser Val Ala Gly Leu 
5060 5065 5070 

Phe Gly Gly Ala Gly Gin Ser Asn Tyr Ala Ala Gly Asn Ala Phe Leu 
5075 5080 5085 

Asp Ala Leu Ala His Cys Arg Gin Ala Gin Gly Leu Pro Ala Leu Ser 
5090 5095 5100 

Leu Ala Ser Gly Leu Trp Ala Ser lie Asp Gly Met Ala Gly Asp Leu 
105 5110 5115 5120 

Ala Ala Ala Asp Val Glu Arg Leu Ser Arg Ala Gly lie Gly Pro Leu 
5125 5130 5135 

Ser Ala Pro Gly Gly Leu Ala Leu Phe Asp Ala Ala Val Gly Ser Asp 
5140 5145 5150 
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Glu Pro Leu Leu Ala Pro Val Arg Leu Asp Val Glu Ala Leu Arg Val 
5155 5160 5165 

Gin Ala Arg Ser Val Gin Thr Arg He Pro Glu Met Leu His Gly Met 
5170 5175 5180 

Ala Met Gly Pro Ser Arg Arg Thr Pro Phe Thr Ser Arg Val Glu Pro 
185 5190 5195 5200 

Leu His Glu Arg Leu Ala Gly Leu Ser Glu Gly Glu Arg Arg Gin Gin 
5205 5210 5215 

Val Leu Gin Arg Val Arg Ala Asp He Ala Val Val Leu Gly His Gly 
5220 5225 5230 

Arg Ser Ser Asp Val Asp lie Glu Lys Pro Leu Ala Glu Leu Gly Phe 
5235 5240 5245 

Asp Ser Leu Thr Ala He Glu Leu Arg Asn Arg Leu Ala Thr Ala Thr 
5250 5255 5260 

Gly Leu Arg Leu Pro Ala Thr Leu Ala Phe Asp His Gly Thr Ala Ala 
265 5270 5275 5280 

Ala Leu Ala Gin His Val Cys Ala Gin Leu Gly Thr Ala Thr Ala Pro 
5285 5290 5295 

Ala Pro Arg Arg Thr Asp Asp Asn Asp Ala Thr Glu Pro Val Arg Ser 
5300 5305 5310 

Leu Phe Gin Gin Ala Tyr Ala Ala Gly Arg He Leu Asp Gly Met Asp 
5315 5320 5325 

Leu Val Lys Val Ala Ala Gin Leu Arg Pro Val Phe Gly Ser Pro Gly 
5330 5335 5340 

Glu Leu Glu Ser Leu Pro Lys Pro Val Gin Leu Ser Arg Gly Pro Glu 
345 5350 5355 ■ 5360 

Glu Leu Ala Leu Val Cys Met Pro Ala Leu He Gly Met Pro Pro Ala 
5365 5370 5375 

Gin Gin Tyr Ala Arg He Ala Ala Gly Phe Arg Asp Val Arg Asp Val 
5380 5385 5390 

Ser Val He Pro Met Pro Gly Phe He Ala Gly Glu Pro Leu Pro Ser 
5395 5400 5405 
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Ala He Glu Val Ala Val Arg Thr Gin Ala Glu Ala Val Leu Gin Glu 
5410 5415 5420 

Phe Ala Gly Gly Ser Phe Val Leu Val Gly His Ser Ser Gly Gly Trp 
425 5430 5435 5440 

Leu Ala His Glu Val Ala Gly Glu Leu Glu Arg Arg Gly Val Val Pro 
5445 5450 5455 

Ala Gly Val Val Leu Leu Asp Thr Tyr He Pro Gly Glu He Thr Pro 
5460 5465 5470 

Arg Phe Ser Val Ala Met Ala His Arg Thr Tyr Glu Lys Leu Ala Thr 
5475 5480 5485 

Phe Thr Asp Met Gin Asp Val Gly He Thr Ala Met Gly Gly Tyr Phe 
5490 5495 5500 

Arg Met Phe Thr Glu Trp Thr Pro Thr Pro He Gly Ala Pro Thr Leu 
505 5510 5515 5520 

Phe Val Arg Thr Glu Asp Cys Val Ala Asp Pro Glti Gly Arg Pro Trp 
5525 5530 5535 

Thr Asp Asp Ser Trp Arg Pro Gly Trp Thr Leu Ala Asp Ala Thr Val 
5540 5545 5550 

Gin Val Pro Gly Asp His Phe Ser Met Met Asp Glu His Ala Gly Ser 
5555 5560 5565 

Thr Ala Gin Ala Val Ala Ser Trp Leu Asp Lys Leu Asn Gin Arg Thr 
5570 5575 5580 

Ala Arg Gin Arg 
585 



<210> 51 
<211> 2363 
<2l2> DNA 

<213> Saccharopolyspora spinosa 
<400> 51 

cccgggccgg ccgtgggcgc cgggtccgcg gggcggaacg ggtctgccgg gctgaccctg 60 
cgggccgcct ggccacctcg gccggtcctc cacgaatcct ccttggcctg ccggcgggcg 120 
ttacagtttc gggaagtgat tttgctcgtg ttccgaatgc aggctagcgg tgttcctggg 180 
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gcggttgggc aggtcccagc aacagtggtg 
cacgttgcgt aatgcgggat cctgcttcgt 
cggacctgca gcgggccgcg aaatcccggc 
cggcggagcc ggtttcatcg gctcgcacta 
cgcattcgcc gacgccgacg tggtcgtgct 
gaacctggcg ccggtcgcgg acaacccccg 
ccgcgaactg gttggcggcc tgatgtccgg 
aacccacgtc gaccgctcga tcaccggctc 
caccaacgtg ctgctgcagg ccgcgctcga 
caccgacgag gtctacggct ccatcgagga 
gccgaattcc ccgtactcgg cggcgaaagc 
ccgcacccac ggactgccgg tgtgcatcac 
gttcccggag aaggtgctgc cgctgttcat 
gctctacggc gacgggctca acgtgcggga 
catccagctg gtggccgact ccgggcgcgc 
cgagctgacc aacaacgagc tgaccgagcg 
ggtggtgcgg ccggtcaccg accgcaaggg 
caagatcgtc gaggaactgg ggtacgcgcc 
gacaatccgc tggtaccagg acaaccggga 
ggtggctcga tgagtcgcct cgccgtgctg 
gagctggccc ggatcctccc cgcgcggacg 
gaactggacg tcaccgacgc cgaggaggtc 
gcgaaggacg cggagctgcg accggtggtg 
gcggccgagt ccgacccgga ccgcgcggcc 
gcgaaagcgt gccggagcag cggtctgccc 
cccggtgatg gggcccggcc gtacgagccg 
gggcgcacca agctcgaagg cgaacgggcc 
gtgcgcacgg catgggtgta cggcgcgagc 
ctctcggggg agcgcgacac gctgtccgtt 
gcggcggacc tggcgagcgg cctgctggag 
ccggagcaga aggtgctgca ctgcaccaat 
cgggcgatct tcgcggaatt cggcctggac 
gacttccccc tcccggcgca ccgcccggcc 
gaggcgggcc tgaccccgat gcgcacctgg 
gacggcgaaa ccctccgaac ccgctgacca 
gggccgcttc gacgcgatat cggctggcgc 
gaaggaaggc caggtgcccc ggg 



atatccctca taagggcgaa gcgacttcgt 240 
agctcggtgt gtcatgccag actgcgcacg 300 
gaggaagggc gcgatgcgga ttctggtcac 360 
cgttcggcag ttgctcggtg gtgcgtaccc 420 
cgacaagctc acctacgccg gcaacgaggc 4 80 
gctgaagttc gtctgcggcg acatctgcga 540 
cgtggacgtg gtggtgcact tcgccgccga 600 
ggacgccttc gtgatcacca acgtggtcgg 660 
cgccgagatc ggcaagttcg tgcacgtttc 720 
cggctcgtgg cccgaagacc acgcgctgga 780 
gggctcggac ctgctggccc gcgcctacca 840 
ccgctgctcc aacaactacg ggccctacca 900 
cacgaacctg atggacggca gccaggtgcc 960 
ctggctgcac gtcagcgacc actgccgggg 1020 
gggcgagatc tacaacatcg gcggcggcac 1080 
gctgctggca gagctgggcc tcgactggtc 114 0 
ccacgaccgc cgctactcgg tggaccacag 1200 
gcaggtcgac ttcgagaccg ggctgcgcga 1260 
ctggtgggag ccgctgaagg cccgatcggc 132 0 
gtgcccggcg gccgcggcca gctgggctcg 1380 
ggggcgctgg tgcaccggcc gggttccggg 144 0 
gccgacgcgt tgggttcctt cgcggagacg 1500 
atcaacgccg cggcgtacac ggcggtggac 1560 
cggatcaacg ccgaaggcgc ggcctcgctg 162 0 
ctggtgcacg tgtcgacgga ttacgtgttc 1680 
acggacccga ccgggccgcg atcggtctac 174 0 
gtgctggagt ccggcgcgcg ggcctgggtg 1800 
ggcaagaact tcctgaaaac gatgatccgc 1860 
gtggacgatc agatcggctc gccgacttgg 1920 
ctggccgaac gggtcgccga acgccgtgga 1980 
tccggccagg tgacctggta cgagttcgcg 2 04 0 
gagaaccgcg tccacccgtg cacgacggcg 2100 
tactcggtcc tgtccgacgt ggcgtggcga 2160 
cgggaagccc tggcggcggc cttcgagaaa 222 0 
gtcacccgga gggcgcgagt agccccggca 22 80 
ggtgcgcaca atgggtgtcg ccggggcgag 2340 

2363 



<210> 52 
<211> 990 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (987) 

<223> ORF23; dNTP -Glucose -4 , 6 -Dehydratase 
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<400> 52 

atg egg att ctg gtc acc ggc gga gec ggt ttc ate ggc teg cac tac 4 8 

Met Arg lie Leu Val Thr Gly Gly Ala Gly Phe He Gly Ser His Tyr 
15 10 15 

gtt egg cag ttg etc ggt ggt gcg tac ccc gca ttc gee gac gee gac 96 
Val Arg Gin Leu Leu Gly Gly Ala Tyr Pro Ala Phe Ala Asp Ala Asp 
20 25 30 

gtg gtc gtg etc gac aag etc acc tac gee ggc aac gag gcg aac ctg 144 
val Val Val Leu Asp Lys Leu Thr Tyr Ala Gly Asn Glu Ala Asn Leu 
35 40 45 

gcg ccg gtc gcg gac aac ccc egg ctg aag ttc gtc tgc ggc gac ate 192 
Ala Pro Val Ala Asp Asn Pro Arg Leu Lys Phe Val Cys Gly Asp He 
50 55 60 

tgc gac cgc gaa ctg gtt ggc ggc ctg atg tec ggc gtg gac gtg gtg 240 
Cys Asp Arg Glu Leu Val Gly Gly Leu Met Ser Gly Val Asp Val Val 
65 70 75 80 

gtg cac ttc gee gee gaa acc cac gtc gac cgc teg ate acc ggc teg 288 
Val His Phe Ala Ala Glu Thr His Val Asp Arg Ser He Thr Gly Ser 
85 90 95 

gac gee ttc gtg ate acc aac gtg gtc ggc acc aac gtg ctg ctg cag 336 
Asp Ala Phe Val He Thr Asn Val Val Gly Thr Asn Val Leu Leu Gin 
100 105 110 

gee gcg etc gac gee gag ate ggc aag ttc gtg cac gtt tec acc gac 3 84 
Ala Ala Leu Asp Ala Glu He Gly Lys Phe Val His Val Ser Thr Asp 
115 120 125 

gag gtc tac ggc tec ate gag gac ggc teg tgg ccc gaa gac cac gcg 432 
Glu Val Tyr Gly Ser He Glu Asp Gly Ser Trp Pro Glu Asp His Ala 
130 135 140 

ctg gag ccg aat tec ccg tac teg gcg gcg aaa gcg ggc teg gac ctg 480 
Leu Glu Pro Asn Ser Pro Tyr Ser Ala Ala Lys Ala Gly Ser Asp Leu 
145 150 155 160 

ctg gee cgc gee tac cac cgc acc cac gga ctg ccg gtg tgc ate acc 528 
Leu Ala Arg Ala Tyr His Arg Thr His Gly Leu Pro Val Cys He Thr 
165 170 175 

cgc tgc tec aac aac tac ggg ccc tac cag ttc ccg gag aag gtg ctg 576 
Arg Cys Ser Asn Asn Tyr Gly Pro Tyr Gin Phe Pro Glu Lys Val Leu 
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ccg ctg ttc ate acg aac ctg atg gac ggc age cag gtg ccg etc tac 624 
Pro Leu Phe He Thr Asn Leu Met Asp Gly Ser Gin Val Pro Leu Tyr 
195 200 205 

ggc gac ggg etc aac gtg egg gac tgg ctg cac gtc age gac cac tgc 672 
Gly Asp Gly Leu Asn Val Arg Asp Trp Leu His Val Ser Asp His Cys 
210 215 220 

egg ggc ate cag ctg gtg gee gac tec ggg cgc gcg ggc gag ate tac 720 
Arg Gly He Gin Leu Val Ala Asp Ser Gly Arg Ala Gly Glu He Tyr 
225 230 235 240 

aac ate ggc ggc ggc ace gag ctg ace aac aac gag ctg ace gag egg 768 
Asn He Gly Gly Gly Thr Glu Leu Thr Asn Asn Glu Leu Thr Glu Arg 
245 250 255 

ctg ctg gca gag ctg ggc etc gac tgg teg gtg gtg egg ccg gtc ace 816 
Leu Leu Ala Glu Leu Gly Leu Asp Trp Ser Val Val Arg Pro Val Thr 
260 265 270 

gac cgc aag ggc cac gac cgc cgc tac teg gtg gac cac age aag ate 864 
Asp Arg Lys Gly His Asp Arg Arg Tyr Ser Val Asp His Ser Lys He 
275 280 285 

gtc gag gaa ctg ggg tac gcg ccg cag gtc gac ttc gag ace ggg ctg 912 
Val Glu Glu Leu Gly Tyr Ala Pro Gin Val Asp Phe Glu Thr Gly Leu 
290 295 300 

cgc gag aca ate cgc tgg tac cag gac aac egg gac tgg tgg gag ccg 960 
Arg Glu Thr He Arg Trp Tyr Gin Asp Asn Arg Asp Trp Trp Glu Pro 
305 310 315 320 

ctg aag gee cga teg gcg gtg get cga tga 990 
Leu Lys Ala Arg Ser Ala Val Ala Arg 
325 



<210> 53 
<211> 329 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 53 

Met Arg lie Leu Val Thr Gly Gly Ala Gly Phe He Gly Ser His Tyr 
15 10 15 
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Val Arg Gin Leu Leu Gly Gly Ala Tyr Pro Ala Phe Ala Asp Ala Asp 
20 25 30 

Val Val Val Leu Asp Lys Leu Thr Tyr Ala Gly Asn Glu Ala Asn Leu 
35 40 45 

Ala Pro Val Ala Asp Asn Pro Arg Leu Lys Phe Val Cys Gly Asp lie 
50 55 60 

Cys Asp Arg Glu Leu Val Gly Gly Leu Met Ser Gly Val Asp Val Val 
65 70 75 80 

Val His Phe Ala Ala Glu Thr His Val Asp Arg Ser lie Thr Gly Ser 
85 90 95 

Asp Ala Phe Val He Thr Asn Val Val Gly Thr Asn Val Leu Leu Gin 
100 105 110 

Ala Ala Leu Asp Ala Glu He Gly Lys Phe Val His Val Ser Thr Asp 
115 120 125 

Glu Val Tyr Gly Ser He Glu Asp Gly Ser Trp Pro Glu Asp His Ala 
130 135 140 

Leu Glu Pro Asn Ser Pro Tyr Ser Ala Ala Lys Ala Gly Ser Asp Leu 
145 150 155 160 

Leu Ala Arg Ala Tyr His Arg Thr His Gly Leu Pro Val Cys He Thr 
165 170 175 

Arg Cys Ser Asn Asn Tyr Gly Pro Tyr Gin Phe Pro Glu Lys Val Leu 
180 185 190 

Pro Leu Phe He Thr Asn Leu Met Asp Gly Ser Gin Val Pro Leu Tyr 
195 200 205 

Gly Asp Gly Leu Asn Val Arg Asp Trp Leu His Val Ser Asp His Cys 
210 215 220 

Arg Gly He Gin Leu Val Ala Asp Ser Gly Arg Ala Gly Glu lie Tyr 
225 230 235 240 

Asn He Gly Gly Gly Thr Glu Leu Thr Asn Asn Glu Leu Thr Glu Arg 
245 250 255 

Leu Leu Ala Glu Leu Gly Leu Asp Trp Ser Val Val Arg Pro Val Thr 
260 265 270 
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Asp Arg Lys Gly His Asp Arg Arg Tyr Ser Val Asp His Ser Lys lie 
275 280 285 

Val Glu Glu Leu Gly Tyi Ala Pro Gin Val Asp Phe Glu Thr Gly Leu 
290 295 300 

Arg Glu Thr He Arg Trp Tyr Gin Asp Asn Arg Asp Trp Trp Glu Pro 
305 310 315 320 

Leu Lys Ala Arg Ser Ala Val Ala Arg 
325 



<210> 54 
<211> 918 
<212> DNA 

<213> Saccharopolyspora spinosa 

<220> 

<221> CDS 

<222> (1) . . (915) 

<223> ORF24; dNTP-4 -keto-6-Deoxyglucose-3 , 5 -Epimerase 
<400> 54 

atg agt cgc etc gec gtg ctg gtg ccc ggc ggc cgc ggc cag ctg ggc 48 
Met Ser Arg Leu Ala Val Leu Val Pro Gly Gly Arg Gly Gin Leu Gly 
15 10 15 

teg gag ctg gee egg ate etc ccc gcg egg acg ggg gcg ctg gtg cac 96 
Ser Glu Leu Ala Arg He Leu Pro Ala Arg Thr Gly Ala Leu Val His 
20 25 30 

egg ccg ggt tec ggg gaa ctg gac gtc ace gac gee gag gag gtc gee 144 
Arg Pro Gly Ser Gly Glu Leu Asp Val Thr Asp Ala Glu Glu Val Ala 
35 40 45 

gac gcg ttg ggt tec ttc gcg gag acg gcg aag gac gcg gag ctg cga 192 
Asp Ala Leu Gly Ser Phe Ala Glu Thr Ala Lys Asp Ala Glu Leu Arg 
50 55 60 

ccg gtg gtg ate aac gee gcg gcg tac acg gcg gtg gac gcg gee gag 240 
Pro Val Val He Asn Ala Ala Ala Tyr Thr Ala Val Asp Ala Ala Glu 
65 70 75 80 

tec gac ccg gac cgc gcg gee egg ate aac gee gaa ggc gcg gee teg 288 
Ser Asp Pro Asp Arg Ala Ala Arg He Asn Ala Glu Gly Ala Ala Ser 
85 90 95 
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ctg gcg aaa gcg tgc egg age age ggt ctg ccc ctg gtg cac gtg teg 336 
Leu Ala Lys Ala Cys Arg Ser Ser Gly Leu Pro Leu Val His Val Ser 
100 105 110 

acg gat tac gtg ttc ccc ggt gat ggg gee egg ccg tac gag ccg acg 384 
Thr Asp Tyr Val Phe Pro Gly Asp Gly Ala Arg Pro Tyr Glu Pro Thr 
115 120 125 

gac ccg acc ggg ccg cga teg gtc tac ggg cgc acc aag etc gaa ggc 432 
Asp Pro Thr Gly Pro Arg Ser Val Tyr Gly Arg Thr Lys Leu Glu Gly 
130 135 140 

gaa egg gec gtg ctg gag tec ggc gcg egg gee tgg gtg gtg cgc acg 4 80 
Glu Arg Ala Val Leu Glu Ser Gly Ala Arg Ala Trp val Val Arg Thr 
145 150 155 160 

gca tgg gtg tac ggc gcg age ggc aag aac ttc ctg aaa acg atg ate 528 
Ala Trp Val Tyr Gly Ala Ser Gly Lys Asn Phe Leu Lys Thr Met lie 
165 170 175 

cgc etc teg ggg gag cgc gac acg ctg tec gtt gtg gac gat cag ate 576 
Arg Leu Ser Gly Glu Arg Asp Thr Leu Ser Val Val Asp Asp Gin lie 
180 185 190 

ggc teg ccg act tgg gcg gcg gac ctg gcg age ggc ctg ctg gag ctg 624 
Gly Ser Pro Thr Trp Ala Ala Asp Leu Ala Ser Gly Leu Leu Glu Leu 
195 200 205 

gee gaa egg gtc gee gaa cgc cgt gga ccg gag cag aag gtg ctg cac 672 
Ala Glu Arg Val Ala Glu Arg Arg Gly Pro Glu Gin Lys Val Leu His 
210 215 220 

tgc acc aat tec ggc cag gtg acc tgg tac gag ttc gcg egg gcg ate 720 
Cys Thr Asn Ser Gly Gin Val Thr Trp Tyr Glu Phe Ala Arg Ala lie 
225 230 235 240 

ttc gcg gaa ttc ggc ctg gac gag aac cgc gtc cac ccg tgc acg acg 768 
Phe Ala Glu Phe Gly Leu Asp Glu Asn Arg Val His Pro Cys Thr Thr 
245 250 255 

gcg gac ttc ccc etc ccg gcg cac cgc ccg gee tac teg gtc ctg tec 816 
Ala Asp Phe Pro Leu Pro Ala His Arg Pro Ala Tyr Ser Val Leu Ser 
260 265 270 

gac gtg gcg tgg cga gag gcg ggc ctg acc ccg atg cgc acc tgg egg 864 
Asp val Ala Trp Arg Glu Ala Gly Leu Thr Pro Met Arg Thr Trp Arg 
275 280 285 
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gaa gcc ctg gcg gcg gcc ttc gag aaa gac ggc gaa acc etc cga acc 912 
Glu Ala Leu Ala Ala Ala Phe Glu Lys Asp Gly Glu Thr Leu Arg Thr 
290 295 300 

cgc tga 918 

Arg 

305 



<210> 55 
<211> 305 
<212> PRT 

<213> Saccharopolyspora spinosa 
<400> 55 

Met Ser Arg Leu Ala Val Leu Val Pro Gly Gly Arg Gly Gin Leu Gly 
15 10 15 

Ser Glu Leu Ala Arg He Leu Pro Ala Arg Thr Gly Ala Leu Val His 
20 25 30 

Arg Pro Gly Ser Gly Glu Leu Asp Val Thr Asp Ala Glu Glu Val Ala 
35 40 45 

Asp Ala Leu Gly Ser Phe Ala Glu Thr Ala Lys Asp Ala Glu Leu Arg 
50 55 60 

Pro Val Val He Asn Ala Ala Ala Tyr Thr Ala Val Asp Ala Ala Glu 
65 70 75 80 

Ser Asp Pro Asp Arg Ala Ala Arg He Asn Ala Glu Gly Ala Ala Ser 
85 90 95 

Leu Ala Lys Ala Cys Arg Ser Ser Gly Leu Pro Leu Val His Val Ser 
100 105 110 

Thr Asp Tyr Val Phe Pro Gly Asp Gly Ala Arg Pro Tyr Glu Pro Thr 
115 120 125 

Asp Pro Thr Gly Pro Arg Ser Val Tyr Gly Arg Thr Lys Leu Glu Gly 
130 135 140 

Glu Arg Ala Val Leu Glu Ser Gly Ala Arg Ala Trp Val Val Arg Thr 
145 150 155 160 

Ala Trp Val Tyr Gly Ala Ser Gly Lys Asn Phe Leu Lys Thr Met He 
165 170 175 
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Arg Leu Ser Gly Glu Arg Asp Thr Leu Ser Val Val Asp Asp Gin lie 
180 185 190 

Gly Ser Pro Thr Trp Ala Ala Asp Leu Ala Ser Gly Leu Leu Glu Leu 
195 200 205 

Ala Glu Arg Val Ala Glu Arg Arg Gly Pro Glu Gin Lys Val Leu His 
210 215 220 

Cys Thr Asn Ser Gly Gin Val Thr Trp Tyr Glu Phe Ala Arg Ala He 
225 230 235 240 

Phe Ala Glu Phe Gly Leu Asp Glu Asn Arg Val His Pro Cys Thr Thr 
245 250 255 

Ala Asp Phe Pro Leu Pro Ala His Arg Pro Ala Tyr Ser Val Leu Ser 
260 265 270 

Asp Val Ala Trp Arg Glu Ala Gly Leu Thr Pro Met Arg Thr Trp Arg 
275 280 285 

Glu Ala Leu Ala Ala Ala Phe Glu Lys Asp Gly Glu Thr Leu Arg Thr 
290 295 300 

Arg 
305 
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^ (54) Title: NUCLEIC ACIDS WHICH CODE FOR THE ENZYME ACTIVITIES OF THE SPINOSYN BIOSYNTHESIS 

g (54) Bezeichnung: NUCLE1NSAUREN, DIE FOR ENZYMAKTIVITATEN DER SPINOSYN-BIOSYNTHESE CODIEREN 

m 

(57) Abstract: The present invention relates to nucleic acids which code for the enzyme activities of the spinosyn biosynthesis. 
^ The invention also relates to the corresponding enzymes as such. The invention further relates to a method for producing spinosyn 
derivatives and spinosyn precursors. 

O (57) Zusammenfassung: Die vorliegende Erfindung betrifft Nucleinsauren, die fur Enzymaktivitaten der Spinosyn-Biosynthese co- 
J> dieren, sowie die entsprechenden Enzyme per se. Weiterhin betrifft die Erfindung Verfahren zum Herstellen von Spinosyn-Derivaten 
^ und -Vorstufen. 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This international searchreporthas not been established in respect of certain claims under Article 1 7(2)(a) forthe following reasons: 
1. I I Claims Nos.: 

' — ' because they relate to subject matter not required to be searched by this Authority, namely: 



2. r~| Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. I] Claims Nos.: 

1 — 1 because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box 11 Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this international search report covers all 

searchable claims. 

2 . As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. I I As only some of the required additional search fees were timely paid by the applicant, this international search report 
1 — 1 covers only those claims for which fees were paid, specifically claims Nos. : 



4. No required additional search fees were timely paid by the applicant. Consequently, this international search report is 

I — ' restricted to the invention First mentioned in the claims; it is covered by claims Nos.: 

1-9, 11. 13-33, 63-69 (all in part) 



Remark on Protest Pj The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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1. Claims Nos. 1-9, 11, 13-33, 63-69 (all in part) 

Nucleic acid according to SEQ ID NO: 7, gene product which can be derived therefrom, 
vector comprising at least the nucleic acid according to SEQ ID NO: 7, host cell 
containing a nucleic acid according to SEQ ID NO: 7, antibody which specifically 
reacts with the gene product, and method for producing the nucleic acid, the polypeptide 
or spinosyn. 



2. Claims Nos. 1-9, 11, 13-31, 34, 35, 63-69 (all in part) 

Gene product like Invention No. 1, however, which can be derived for the nucleic acid 
according to SEQ ID NO: 9, and subject matter related thereto. 



3. Claims Nos. 1-72 (all in part and if applicable) - 24. Claim Nos. 1-72 (all in part and if 
applicable) 

Gene products like Invention No. 1 , however, which can be derived for the nucleic acid 
according to SEQ ID NO: 1 1 to SEQ ID NO: 54, and subject matter related thereto. 
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A. KLASSIRZIERUNG DES ANMELOUNGSGEGENSTANOES 

IPK 7 C12N15/52 C12N15/70 C12N1/21 C12N5/14 C07K14/195 
C12P19/62 

Nach der Intemationalen Patentklassffikation (IPK) Oder nach der nationalen Klassifikation und der IPK 



B. RECHERCHIERTE GEBIETE 



Recherchierter Mindestprufstoff (Wassifikations system und Klassifikationssymbole ) 

IPK 7 C07K C12N C12P 



Recherchterte aber nicht zum Mindestprflfstoff gehOrende VerSffenBichungen, soweit diese unter dta recherchierten Gebieta fallen 



Wahrend der Intemationalen Recherche konsultierte elektrontsche Datenbank (Name der Datenbank und evtl. verwendete Suchbegriffe) 



a ALS WESENTUCH ANGESEHENE UNTERLAGEN 



Kategorie" Bezeichnung der Veroffentiichung, soweit erfoidertich unter Angabe der in Betracht kommenden Teile 



Betr. AnspruchNr. 



KIRST, H.A. ETAL.,: "Discovery, 
Isolation, and Structure Elucidation of a 
Family of Structurally Unique, 
Fermentation-Derived Tetracyclic 
Macrolides." 

1992 , BAKER, D.R., FENYES, J.G. AND 
STEFFENS J.J. , AMERICAN CHEMICAL SOCIETY 
SYMPOSIUM SERIES NR. 504: SYNTHESIS AND 
CHEMISTRY OF AGROCHEMICALS III. WASHINGTON 
DC XP000944494 
Seite 214 -Seite 255 
das ganze Dokument 

-/-- 



1-6, 
13-17, 
19-32, 
63-69 



X I Weitere Verfiffentichungen sind der Fortsetzung von Feld C zu 
—J entnehmen 



|X I Siehe Anhang Patenttamilie 



° Besondere Kategorien von angegebenen VerSffentiichungen : 
'A" Veroffentiichung, die den aJIgemeinen Stand der Techrik dsfniert, 
aber nicht als besonders bedeutsam anzusehen ist 

"E" afteres Dokument. das jedoch erst am Oder nach dem intemationalen 
Anmeidedatum veroffentlicht worden ist 

Veroffentiichung, die geekjnet ist, einen Prioritatsanspruch zweifelhaft er- 
scheinen zu lassen, Oder durch die das Veroffentiichungsdatum einer 
anderen tm Recherchenbericht genartnten Veroffentiichung belegt werden 
soQ oder die aus einem anderen besonderen Grund angegeben ist (wie 
ausgefuhrt) 

"O" Verdffentlichung, die sich auf eine mOndliche Offenbarung, 

eine Benutzung, eine Ausstellung oder andere MaBnahmen bezieht 
P* Vereffentifchung, die vor dem intemationalen Anmeidedatum, aber nach 
dem beanspruchten Prioritatsdatum veroffentlicht worden ist 



T Spatere Veroffentiichung, die nach dem intemationalen Anmeidedatum 
oder dem Prioritatsdatum verfiffentiicht worden ist und mit der 
Anmeldung nicht kollidiert, sondem nur zum Verstandnis des der 
Erfindung zugrundeliegenden Prinzips oder der ihr zugrundeliegenden 
Theone angegeben ist 

"X* Veroffentiichung von besonderer Bedeutung; die beanspruchte Erfindung 
kann ailein aufgrund dieser Veroffentiichung nicht als neu oder auf 
erfinderischer Tatigkeit beruhend betrachtet werden 

"Y" Veroffentiichung von besonderer Bedeutung; die beanspruchte Erfindung 
kann nicht als auf erfinderischer Tatigkeit beruhend betrachtet 
werden, wenn die Veroffentiichung mit einer oder mehreren anderen 
VerOffentlichungen dieser Kategorie in Verbindung gebracht wird und 
diese Verbindung fur einen Fachmann naheliegend ist 
Veroffentiichung, die Mitglied derselben PatentfamiUe ist 



Datum des Abschlusses der intemationalen Recherche 

15. Dezember 20G0 


Absendedalum des intemationalen Recherchenberichts 
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Name und Postanschrift der Intemationalen Recherchenbehdrde 
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Tel. (+31 -70) 340-2040, Tx. 31 651 epo nl, 
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C<Fortsetzung) ALS WESENTUCH ANGESEHENE UNTERLAGEN 



Kategorie" Bezeichnung der VerOffentlichung, scweit erforderlich unter Angabe der in Betracht kommenden Teile 



Betr. Anspruch Nr. 



p,x 



MATSUSHIMA P ET AL: "Conjugal transfer of 
cosmid DNA from Escherichia col i to 
Saccharopolyspora spinosa: effects of 
chromosomal insertions on macrolide A83543 
production" 

GENE, NL, ELSEVIER BIOMEDICAL PRESS. 
AMSTERDAM , 

Bd. 146, Nr. 1, 1994, Seiten 39-45, 
XP002106258 

ISSN: 0378-1119 
das ganze Dokument 

H0PW00D D A: "Genetic contributions to 
understanding polyketid synthases" 
CHEMICAL REVIEWS, US, AMERICAN CHEMICAL 
SOCIETY. EASTON, 

Bd. 97, Nr. 7, November 1997 (1997-11), 
Seiten 2465-2497, XP002130647 
ISSN: 0009-2665 
das ganze Dokument 

BALTZ R H ET AL: "Molecular genetic 
methods for improving secondary-metabolite 
production in actinomycetes" 
TRENDS IN BIOTECHNOLOGY, GB, ELSEVIER 
PUBLICATIONS, CAMBRIDGE, 
Bd. 14, Nr. 7, 1. Juli 1996 (1996-07-01), 
Seiten 245-250, XP004035763 
ISSN: 0167-7799 
das ganze Dokument 

WO 99 46387 A (BROUGHTON M CHRISTINE 
;MADDURI KRISHNAMURTHY (US); WALDRON CLIVE 
() 16. September 1999 (1999-09-16) 
das ganze Dokument 



1-9,11, 

13-33, 

63-69 



1-9,11, 

13-33, 

63-69 
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INTERNATIONALER RECHERCHENBERICHT 



Ink .donates Aktenzeichen 

PCT/EP GG/08013 



FekJ I Bemerkungen zu den Anspruchen, die sich als nicht recherchierbar erwiesen haben (Fortseteung von Punkt 2 auf Blatl 



GemSB Artikel 1 7(2)a) wurde aus folgenden GrOnden fflr bestimmte AnsprOche kein Recherchenbericht ersteffc 
1. | | AnsprOche Nr. 

weil sie sich auf Gegenstande beziehen, zu deren Recherche die Behorde nicht verpffichtet ist, namlich 



2. [^J AnsprOche Nr. 

weil sie sich auf Teile der rnternationalen Anmeldung beziehen, die den vorgeschriebenen Anforderungen so wenig entsprechen 
daS eine sinnvolle intemationale Recherche nicht durchgefOhrt werden kann, namlich " 



3. AnsprOche Nr. 

weil es sich dabei um abhangige AnsprOche handelt, die nicht entsprechend Satz 2 und 3 der Regel 6.4 a) abgefaBt sind. 



Feld II Bemerkungen bei mangelnder Einheitlichkeit der Erfindung (Fortsetzung von Punkt 3 auf Blatt 1 ) 



Die intemationale Recherchenbehdrde hat festgesteltt, daB diese intemationale Anmeldung mehrere Erfindungen enthaJt: 



1 • I I Da der Anmelder aile erforderfichen zusatziichen RecherchengebOhren rechtzeitig entrichtet hat, erstreckt sich dieser 
1 — 1 intemationaJe Recherchenbericht auf aJle recherchierbaren AnsprOche. 

2. I | Da fur alle recherchieroaren AnsprOche die Recherche ohneeinen Arbertsaufwand durchgefOhrt werden konnte der eine 
— zusatzJiche Recherchengebuhr gerechtfertigt hatte, hat die BehSrde nicht zur Zahfung einer solchen GebOhr aufgefordert. 



3. f - ] Da der Anmelder nur einige der erforderlichen zusatziichen RecherchengebOhren rechtzeitig entrichtet hat, erstreckt sich dieser 
— intemationale Recherchenbericht nur auf die AnsprOche, fOr die GebOhren entrichtet worden sind, namlich auf die 
AnsprOche Nr. 



4 - [)C-I Der An melder °at die erforderlichen zusatziichen RecherchengebOhren nicht rechtzeitig entrichtet. Der intemationale Recher- 
Mt beschrankt sich daher auf die in den AnsprOchen zuerst erwahnte Erfindung; diese ist in folgenden Anspruchen er- 

1-9, 11 , 13-33, 63-69 alle parti ell 



Bemerkungen hinsichtlich eines Widerspruchs |~] Die zusatziichen GebOhren wurden vom Anmelder unter Widerspruch gezahlt. 

[ [ Die Zahlung zusatzlicher RecherchengebOhren erfolgte ohne Widerspruch. 
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darauf beziehen. 



3. Anspruche: 1-72 (alle partiell und sofern zutreffend) 
24. Anspruche: 1- 
72 (alle partiell und sofern zutreffend) 



Wie Erfindung 1 aber fiir Nucleinsaure gemass SEQ ID N0:11 
bis SEQ ID NO: 54, davon ableitbare Genprodukte und 
Gegenstande welche sich darauf beziehen. 
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